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Abstract. Let /C be a flag kernel on a homogeneous nilpotent Lie group G. We 
prove that operators T of the form T(f) = f*IC form an algebra under composition, 
and that such operators are bounded on L P (G) for 1 < p < oo. 
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This is the first of two papers dealing with singular integral operators with flag 
kernels on homogeneous nilpotent groups. Our goal is to show that these operators, 
along with appropriate sub-collections, form algebras under composition, and that 
the operators in question are bounded on LP . 

Operators of this kind arose initially when studying compositions of sub-elliptic 
operators on the Heisenberg group (such as the sub-Laplacian £ and with elliptic- 
type operators. In particular in [MRS95] one saw that operators of the form m(C, iT), 
(where m is a "Marcinkiewicz multiplier" ) are singular integrals with flag kernels and 
satisfy L p estimates. The theory was extended in jNRSOl] to encompass general flag 
kernels in the Euclidean space M. N , and the resulting operators arising via abelian 
convolution. In addition, aspects of the CR theory for quadratic manifolds could 
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be studied via such operators on various step-2 groups. More recently, flag kernels 
have been studied in |Yan09j and |DLM10j . In view of this, and because of their 
potential further application, it is desirable to extend the above results in |NRS01] 
to the setting of homogeneous groups of higher step. To achieve this goal requires 
however that we substantially recast the approach and techniques used previously, 
since these were essentially limited to the step 2 case. 

Our main results are two-fold. Suppose G is a homogeneous nilpotent group and 
fC denotes a distribution on G which is a flag kernel (the requisite definitions are 
given below in Definition 12. 3p . 

Theorem A. The operators T of the form T(f) = f * JC form an algebra under 
composition. 

Theorem B. The above operators are bounded on L P (G) for 1 < p < oo. 

Given the complexity of the material, in this introductory section we provide 
the reader with an outline of the main ideas that enter in the proofs of the above 
theorems. Moreover, in order to simplify the presentation we will often not state 
matters in the most general setting and sometimes describe the situation at hand a 
little imprecisely. 

1.1. Flag kernels. 

We start with a direct sum decomposition 1R = IR ai © • • • © IR a " , with X] j=i a j = 
N, and we write x = (x 1; x 2 , • • ■ , x n ), with x m e IR am . We also fix a one-parameter 
family of dilations 5 r on WL N , given by 5 r (x) = (r dl xi, . . . , r dn x n ), with positive 
exponents d\ < cZ 2 ■ ■ ■ < c^E We denote by Qk = d k a k the homogeneous dimension 
of M. ak . We also define the partial "norms" A^(x) = |xfc|y dfc where |xjt| e is the 
standard Euclidean norm on IR afc . 

In this setting, a flag kernel /C is a distribution on 1R which is given by integration 
against a C°° function A(x) away from x x = and which satisfies two types of 
conditions. The first are the differential inequalities for xi ^ 0: 

n 

\%K(x)\ < C a J[(iVi(x) + iV 2 (x)-- - + N k (x))- Q "- d ^" (1.1) 
k=i 

with a = (cci, . . . , a n ). The second are the cancellation conditions. These are most 
easily expressed recursively. Let (JC, cp) denote the action of the distribution K on a 
test function ip. At the beginning of the recursion there is the following condition, 
in many ways typical of the others: 

sup | (JC,<f R ) | < oo (1.2) 

R 

where R = (Ri,R 2 , ■ ■ -R n ), Pr( x ) — { p{R'\Xi ) R^x^ ■ • ■R'^x n ), and ip is an arbi- 
trary C°° function which is supported in the unit ball. More generally, one requires 



One can also allow non- isotropic dilations on each subspace W l . See Section [2] below. 
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that the action of /C on a test function in some subset of variables {x mi , . . . , x m/3 } pro- 
duces a flag kernel in the remaining variables {x^, . . . , x/ a }. The precise formulation 
of these conditions is given in Section [2] and Definition 12.31 below. 

1.2. Dyadic decomposition. 

A main tool used in studying flag kernels is their dyadic decomposition into sums 
of "bump functions". This proceeds as follows. Let / = (z'i,^--- ,i n ) denote any 
indexing set of integers that satisfies. 

% x < i 2 < • • • < i n _ x < i n . (1.3) 

Also let {f 1 } be a family of C°° functions supported in the unit ball that are uni- 
formly bounded in the norm for each m. Set 

[</]i(x) = 2" llQl ~ i2Q2 - ~ inQn /(2" a!lll x 1 , • ■ ■ 2~ dfel "x n ) , 

so that the [y? 7 ]/ are L 1 -normalized. We say that the ip 1 satisfy the "strong cancel- 
lation" condition if for each k with 1 < k < n, 

J ip I (x 1 ,...,x k ,...,ii n )dx k = (1.4) 

when all the inequalities ( 11. 3ft for / are strict. In the case that there are some 
equalities in ( II. 3ft . say < %i = — ■ ■ ■ — ik < ik+i, then only cancellation in 
the collection of corresponding variables is required: 

</(xi, . . . , >q, . . . , x fc , . . . , x n ) dyii ■ ■ ■ dx k = 0. (OI) 
The first result needed is that any sum 

made up of such bump functions, with cancellation condition (ll.4p and (11.41 ). con- 
verges in the sense of distributions to a flag kernel, and conversely, any flag kernel K, 
can be written in this way (of course, not uniquely). 

There are two parts to this result (which in effect is stated but not proved com- 
pletely in [NRSOlj ). The first is that the sum in ( II. 5p is indeed a flag kernel. To see 
this, one can use the estimate in Proposition 111.11 given in Appendix II below; one 
also notes from this that even without the cancellation conditions (II .4p and (11.41 ). 
the sum (11.51) satisfies the differential inequalities (11.11) . The converse part requires 
Theorem 16.11 below, and the observation that the parts of the sum (11.51) contributed 
by J's where there may be equality in (11.31) give flag distributions corresponding to 
various "coarser" flags. 

However, what will be key in what follows is that the strong cancellation conditions 
(jl.4j) or (11.41 ) can be weakened, and still lead to the same conclusion. While these 
"weak" cancellation conditions are somewhat complicated to state (see Definition 
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15.51 below), they are easily illustrated in the special step 2 case. Here we have the 
decomposition Mr = M ai © IR a2 , x = (xi,x 2 ). The cancellation condition for the 
second variable is as before: J <^ 7 (xi, x 2 )<ix2 = 0. For xi the weak cancellation 
condition takes the form 

J /(x^x^dxx = 2-^" ll )r / / (x 2 ), (1.6) 

for some e > 0, with / = (ii,^) and r] an L 1 normalized bump in the x 2 variable. 

In this context the main conclusion (Theorem 16.81) is that the sum ( 11.51) is still 
a flag kernel if the weak-cancellation conditions are assumed instead of ( 11.41) and 
( 11.41 ). and the functions {f 1 } are allowed to belong to the Schwartz class instead 
of being compactly supported. In understanding Definition 15.51 one should keep 
in mind that conditions like ( ll.4j) which involve vanishing of integrals are equivalent 
with expressions of the ip 1 as the sums of appropriate derivatives. (This is established 
in Lemma [5. ip . 

1.3. Other properties of flag kernels. 

Along with the results about decompositions of flag kernels, there are a number of 
other properties of these distributions that are worth mentioning and are discussed 
in Section [61 First, the class of flag kernels is invariant under the change of variables 
compatible with the structure of the flags. We have in mind transformations x h > y = 
F(x), with yk = X/t + -Pfe(x), and Pk a homogeneous polynomial of xi, . . . Xfe_i, of the 
same degree as x^. The fact that KoF satisfies the same differential inequalities (II. ip 
as /C is nearly obvious, but the requisite cancellation conditions (such as (11.21) ) are 
more subtle and involve the weak cancellation of the bump functions. (See Theorem 

Em) 

A second fact is that the cancellations required in the definition of a flag kernel 
can be relaxed. For example, assuming that the differential inequalities (11.11) hold, 
then the less restrictive version of (11.21) requires that the supremum is taken only 
over those R for which R\ > i? 2 • • • > R n > 0. The formulation and proof of the 
sufficiency of these restricted conditions is in Theorem 16. 131 

Finally we should point out that at the basis of many of our arguments is an 
earlier characterization in [NRSOlj of flag kernels in terms of their Fourier transforms: 
these are bounded multipliers that satisfy the dual differential inequalities given in 
Definition 16.31 

1.4. Graded groups and compositions of flag kernels. 

Up to this point our discussion of flag kernels has focused on their definition as 
distributions on the Euclidean space M . We now consider convolutions with flag 
kernels on graded nilpotent Lie groups G whose underlying space is M . The choice of 
an appropriate coordinate system on the group G, and its multiplication structure, 
induces a decomposition Mr = lR ai © • • • © lR a ™ and allows us to find exponents 
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di < d,2 ■ ■ • < d n as above so that the dilation 5 r (x) = (5 ri x 1; ■ • ■ , 5 r "x n ), 5 > 0, are 
automorphisms of G. 

The proof of Theorem A reduces to the statement that if K\ and K2 are a pair 
of flag kernels, then /Ci * /C2 is a sum of flag kernels, where the convolution is taken 
with respect to G. Note that when G is the abelian group R , the result follows 
immediately from the characterization of flag kernels in terms of their Fourier trans- 
forms, cited earlier, and in fact the convolution of two flag kernels is a single flag 
kernel. In the non-commutative case the proof is not as simple and proceeds as fol- 
lows. First write /Ci = Yli [f 1 ]^ ^2 = bP J ]j m terms of decompositions with 
bump functions with strong cancellation. Now formally 

K 1 *K i = x;xy]/ * [¥h- (1.7) 
/ j 

We look first at an individual term [y? 7 ]/ * [i> J ]j in the above sum. It has three 
properties: 

(a) [y? 7 ]/ * \& J \j = [0 i,j ]k with [6 i,j ]k a "bump" scaled according to K, where 
K = I V J; that is K — (fci, &2, • • • fc n ), and fc m = max(i m , j m ) 1 < m < n. This 
conclusion holds even if we do not assume the cancellation conditions on [<^ 7 ]/ 
and [ip J ]j. 

(b) Next, because we do have the cancellation conditions (jl.4j) and (11.41 ). we have a 
gain: There exists e > so that [6 i,j ]k can be written as a finite sum of terms 
of the form 

2-AH-h\ 2 -e [( im + 1_im )+(-? m + 1- -? m ) \9 I,J ] K 

leA meB 

where [9 i,j ]k is another bump function scaled according to K, and A and B are 
disjoint sets with AUB = {l,...,n}. 

(c) The strong cancellation fails in general for [9 i,j ]k, but weak cancellation holds. 
The statements (a), (b), and (c) above are contained in Lemmas 17.21 and 17.31 

With these assertions proved, one can proceed roughljH as follows. We define 9~k — 
[(p T ]i * [(p^]j, where the sum is taken over all pairs (J, J) for which / V J — K. 

IVJ=K 

Because of the exponential gain given in (b) this sum converges to a i^-scaled bump 
function. Moreover, because of (c), 9k satisfies the weak cancellation property. As a 
result, the sum ^2 k ^k converges to a flag kernel, and hence 1C\ * K2 is a flag kernel 
as was to be shown. 

We comment briefly on the arguments needed to establish (b) and (c). Here we 
use the strong cancellation properties of [y? 7 ]/ (or [ip J ]j). For (b) we express [y? 7 ]/ as 
a sum of derivatives with respect to appropriate coordinates, then re-express these 
in terms of left-invariant vector fields, and finally pass these differentiation to [ip J ] j. 



2 Thcre arc actually additional complications. Wc must first make a preliminary partition of the 
set of all pairs (7, J), and the result is that /Ci * IC2 is actually a finite sum of flag kernels. 
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The reverse may be done starting with cancellation of To obtain (c), the 

weak cancellation of [y? 7 ]/ * [^ J ]j) we begin the same way, but express [y? 7 ]/ in terms 
of right- invariant vector fields and then pass these differentiations on the resulting 
convolution products. The mechanism underlying this technique is set out in the 
various lemmas of Section [31 

The argument is a little more complex when we are in the case of equality for 
some of the indices's that arise in I or J. This in effect involves convolutions with 
kernels belonging to coarser flags. The guiding principle for convolutions of such 
bump functions (or kernels) is that if )Cj are flag kernels corresponding to the flag 
Tj, j — 1, 2, then /C x * /C 2 is a flag kernel for the flag J 7 which is the coarsest flag 
that is finer than Fi and J-" 2 . The combinatorics involved are illustrated by several 
examples given in Sections 18.31 and 18.51 

1.5. LP estimates via square functions. 

The proof of the LP estimates (Theorem ( I9.14p ) starts with the descending chain 
of sub-groups G — Gi D G2 Z> ■ ■ • D G n where 



when m > 2. We observe that the dilation's 5 r restrict to automorphisms of the G m . 
We then proceed as follows: 

(i) The standard (one-parameter) maximal functions and square functions on each 
group G m , as given in [FS], are then "lifted" (or "transferred") to the group G. 

(ii) Compositions of those lifted objects lead to (n-parameter) maximal functions 
and square functions on G. Among these is the "strong" maximal function 



for which one can prove vector- valued LP inequalities. Here R s = {x : |x^| < 
s^}, with (si, • • ■ , s n ) restricted to si < s 2 ■ ■ ■ < s k - There are also a pair of 
square functions, S and (3, with the property that 



/ ||i*< A p || S(f) \\ LP and || &{f) \\ LP < A' || / \\ LV , for 1< p < 00 . (1.8) 



(iii) The connection of these square functions with our operators T, given by Tf = 
f * K. with K. a flag kernel, comes about because of the point-wise estimate: 



which is Lemma [9.131 below. 

Now (11. 8p together with (11. 9p prove the LP boundedness of our operators. 

Among the ideas used to prove (II. 9p is the notion of a "truncated" flag kernel: 
such a kernel is truncated at "width a", a > 0, if it satisfies the conditions such as 
( ll.ip . but with Ni + • • • + N k replaced by a + N 1 + - ■ - + N k throughout, (see Definition 
17.51) . A key fact that is exploited is that a convolution of a bump of width b with a 



G m = {x = (xi, x 2 , ■ ■ • x„) : xi = 0, x 2 = , . . . , 



x m-l — 0}, 




S(Tf)(x)<c&(f)(x), 



(1.9) 
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truncated kernel of width a yields a truncated kernel of width a + b. For this, see 
Theorem 19.7} and its consequence, Theorem 19.91 

1.6. Final remarks. 

The collection of operators with flag kernels contains both the automorphic (non- 
isotropic) Calderon-Zygmund operators as well as the usual isotropic Calderon- 
Zygmund operators with kernels of compact support, (broadly speaking, the standard 
pseudo-differential operators of order 0). But flag kernels, by their definition, may 
have singularities away from the origin. Thus the algebra we are considering consists 
of operators that are not necessarily pseudo-local. The study of a narrower alge- 
bra that arises naturally, which consists of pseudo-local operators and yet contains 
both types of Calderon-Zygmund operators, will be the subject of the second paper 
|NRSVV11] in this series. 

The authors are grateful to Brian Street for conversations and suggestions about 
the decomposition of flag kernels into sums of dilates of compactly supported func- 
tions. We would also like to thank the referee for a very careful reading of the paper. 
We note that the topic of this paper was the subject of several lectures given by 
one of us (EMS), in particular at a conference in honor of F. Treves at Rutgers, 
April, 2005, and at Washington University and U.C.L.A in April and October 2008. 
During the preparation of this paper we learned of the work of Glowacki |Glol0aj . 
|Glol0bj . and |Glo01] where overlapping results are obtained by different methods. 
We should also mention a forthcoming paper of Brian Street |Strl0j that deals with 
the L 2 -theory in a more general context than is done in the present paper. 



2. Dilations and flag kernels on 



Throughout this paper we shall use standard multi-index notation. Z denotes the 
set of integers and N denotes the set of non-negative integers. If a = (ai, . . . , a at) G 
N N , then |a| = «i + • — h and a\ — a.\\ ■ ■ ■ ajyl. If x = (x±, . . . , x^) G R N , then 
x a = x" 1 ■ ■ -x^ . For 1 < j < N, d Xj (or more simply dj) denotes the differential 
operator If a G then d a denotes the partial differential operator d® 1 ■ ■ ■ d^ N . 

The space of infinitely differentiable real-valued functions on R with compact 
support is denoted by Q°(R ) and the space of Schwartz functions is denoted by 
<S(R ). The basic semi- norms on these spaces are defined as follows: 



if cp eC™{R N ) 



|MI(m) = sup < 


>>(x)| 


\\<p\\[M] = sup ' 


1(1 + Me 



\a\ <m, xER N 



if <p G S(R N ), 

Here |x| e denotes the usual Euclidean length of x G R N 



< M, x G K^} 



2.1. The basic family of dilations. 
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Fix positive real numbers < d\ < d 2 < ■ ■ ■ < d N , and define a one-parameter 
family of dilations on R N by setting 

5 r [x] = r-x= (r dl xi,. . . ,r dN x N ). (2.1) 

Also fix a smooth homogeneous norm |x| on R N so that \r ■ x| = r |x|. The homo- 
geneous ball of radius r is B(r) = {x 6 | |x| < r}, and the homogeneous dimen- 
sion of R N (relative to this family of dilations) is Q = d\ + • • • + d^. Recall that 
|x| e = \J x\ + • • • + x 2 N denotes the ordinary Euclidean length of a vector x £ R N . 
If m(x) = cx a = ex" 1 •••a^y v is a monomial, then m(r • x) = r aidl+ "' ajvdjv m(x), 
and the homogeneous degree of m is A(m) = aidi + ■ ■ -aijvdjv- In particular, the 
homogeneous degree of a constant is zero. We shall agree that if the homogeneous 
degree of a monomial is negative, the monomial itself must be identically zero. With 
this convention, if m is any monomial, we have 

A(djm) = A(m) - dj. (2.2) 

We denote by Wd the space of real- valued polynomials which are sums of monomials 
of homogeneous degree d. We have the following easy result. 

Proposition 2.1. If P is a polynomial, then P £ Hd if and only if P(x) = 



Moreover: 



Y^aesia c « x ° where fi d = ja = (cm, . . . , a N ) £ N n Y^jLi a j d j = d } 

(1) if P eU d then P(r ■ x) = r d P(x); 

(2) if P £ H dl and Q £ U d , z , then PQ £ T-i dl +d 2 ; 

(3) ifPeUd then c\.(P)(x) = if d k > d. 

2.2. Standard flags and flag kernels in M^. 

If X is an ^-dimensional vector space, an n-step flag in X is a collection of 
subspaces Xj Q X , 1 < j < n, such that (0) C X : C X 2 C • • • C X n _i C X n = X. 
When X = IR^ we single out a special class of standard flags parameterized by 
partitions N = a\ + • • ■ + a n where each Oj is a positive integer. We write 

R N = R a i ©•••©R a ™, (2.3) 

and we write x £ M. N as x = (x 1; . . . , x n ) with x^ £ M. a i . With an abuse of notation, 
we identify M. ak with vectors in M. N of the form (0, . . . , 0, x^, 0, . . . , 0). Then the 
standard flag J 7 associated to the partition N — a\+- ■ -+a n and to the decomposition 
(12. 3p is given by 

(0) C R an C M""- 1 © R an C ■ • • C R a2 © • • • © R an C R ai © • • • © R a " = R N . (2.4) 

In dealing with such decompositions and flags, it is important to make clear which 
variables in 1R appear in which factor R ai . We can write x £ R N either as x = 
(xi, . . . , xn) with each Xj £ R, or as x = (xi, . . . , x n ) with x^ = (x Pl , . . . ,x qi ) £ R ai 
so that qi = p\ + a\ — 1. Denote by J[ = {pi,pi + 1, . . . , qi} the set of subscripts 
corresponding to the factor R ai so that {!,..., N} is the disjoint union J\ U • • ■ U J n . 
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There is a mapping 7r : {1, . . . , N} \-t {1, . . . , n} so that j G J^y) for 1 < j < N. 
Thus for example 7r(10) = 3 means that the variable Xio belongs to the factor W 13 . 

With the family of dilations defined in ( 12. ip . the action on the subspace IR a ' is 
given by 

r -x; = {r d »x Pv ...,r d "ix qi ). (2.5) 
The homogeneous dimension of M. a ' is 

Qi = d Pl H h d gi = ^ dj- (2.6) 

jeJi 

The function 

= sup Ixsl 1 ^ (2.7) 

is a homogeneous norm on M. ai so that N[(r -x ; ) = r A^(x^). If a = («i, . . . , ajy) £ N^, 
let a; = (ojpp . . • , oc qi ), and set 

[a t ] = a Pl d Pl + ■■■ + a qi d qi = ^ oijdj. (2.8) 

We can introduce a partial order on the set of all standard flags on M. N . 

Definition 2.2. Let A = (ai, . . . , a r ) and £> = . . . , b s ) be two partitions of N so 
that N = ai + ■ ■ • + a r = b\ + ■ ■ ■ + b s . 

(1) The partition A is finer than the partition B, (or B is coarser than A), if there 
are integers {1 = a\ < «2 < • • • < a s +i = r + 1) so that bk = Y^jSx k 1 a r We 
write A z< B or B h A. If A < B but A ^ B we write A -< B or B y A. 

(2) If Tj± an d Fb are the flags corresponding to the two partitions and if A ~2 B (or 
A -< B), we say that the flag J 7 ^ is finer than J-g (or J-g is coarser than J 7 ^) 
and we also write d Fb an d Fb J^a ( or J^A Fb and Tb y Fa)- 

We recall from [NRSOlj the concept of a flag kernel on the vector space 1R associ- 
ated to the decomposition M ai ©• • -©M a " , equipped with the family of dilations given 
in equation (12.11) . Let T be the standard flag given in (12 .4p . In order to formulate 
the cancellation conditions on the flag kernel, we need notation which allows us to 
split the variables {x 1; . . . , x n } into two disjoint sets. Thus if L = . . . , l a } and 
M = {mi, . . . , mp} are complementary subsets of {1, . . . , n} (so that a + (3 = n), 
let N a = + ■ ■ ■ + ai a and N b = a mi + • • ■ + a m „. Write x G WL N as x = (x', x") 
where x' = (x^, . . . , x; Q ) and x" = (x mi , . . . , x m „). If / is a function on IR^" and g 
is a function on IR^ 6 , define a function / ® g on Mr by setting 

f®g(x. x ,...,Xn) = f(x h , . . . ,x la )g(x mi , . . . ,x m/3 ). 

Definition 2.3. A flag kernel adapted to the flag J 7 is a distribution JC G S'(R N ) 
which satisfies the following differential inequalities (part (jaj) ) and cancellation con- 
ditions (part (jb]) ). 
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(a) For test functions supported away from the subspace Xi = 0, the distribution 
/C is given by integration against a C°° -function K . Moreover for every a = 
. . . , ckjv) G %> N there is a constant C a so that if dk = (o! Pk , ■ ■ ■ , a q k )> then for 

Xl 



\d a K(x)\ < C a n^i(xi) + ---iV fc (x fc )]- 



k=l 



(b) Let {l,...,n} = LUM with L = {k, . . .,l a }, M = {m 1 , . . .,mp} andLnM = 
be any pair of complementary subsets. For any ip G C^(M. Nb ) and any positive real 
numbers R u ...,Rp, put ij; R (x mi , . . . ,x m/3 ) = • x mi , . . . , Rp ■ x m0 ). Define 

a distribution )Ct R G iS'(IR ai i + "' +a! <') by setting 

for any test function <p G 5(IR ai i + ' +a '' ). Then the distribution /C* R satisfies the 
differential inequalities of part (jaj) for the decomposition M. a 'i © • ■ ■ © Mr lr . More- 
over, the corresponding constants that appear in these differential inequalities are 
independent of the parameters {R\, . . . , R s }, and depend only on the constants 
{C a } from part (jaj) and the semi-norms of ip. 

The constants {C a } in part (jaj) and the implicit constants in part (jb]) are called the 
flag kernel constants for the flag kernel K. 

Remarks 2.4. 

(a) This definition proceeds by induction on the length n of the flag. The case 
n — 1 corresponds to Calderon-Zygmund kernels, and the inductive definition is 
invoked in part (jb]). 

(b) With an abuse of notation, the distribution /C* R is often written 

k *r( x Ii> ■ ■ ■ > X 'J = J '"J ^( x )^i?(x mi ,...,x m Jc/x mi ---c/x ms . 



3. Homogeneous vector fields 



In Section ITTfl below, where we consider a nilpotent Lie group G whose underlying 
space is K , we will need to consider the families of left- and right-invariant vector 
fields on G. At this stage, before we introduce the group structure, we consider 
instead two spanning sets of vector fields {Xi, . . . , X^} and {Yi, . . . Y^} on Mr which 
are homogeneous with respect to the basic family of dilations given in (12.11) ; this 
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means that if Zj is either Xj or Yj for 1 < j < N, then Zj can be writtenEI 

Zj[z/,](x) = c^#](x) + ^j(x)«](x), (3.1) 

di>dj 

with Pj £ H dl -d r It follows from part of Proposition O that d k (Pj) = 
if dk > d\. Thus we can commute the operators given by multiplication by the 
polynomial P- and differentiation with respect to xi and also write 

N 

Z,#](x) = fi,M(x) + di[P l M*)- (3-2) 
It follows from or (Q that Z w = <9at. 

Proposition 3.1. If P £ "Hd and Zj zs either Xj or Yj, then Zj[P] £ T-id-d , and if 
dj > d, Zj[P] = 0. 

Proo/. It follows from (Q that if P e H d , then <9,[P] £ and since Pj £ 

Hdt-dj, it follows from part ([2]) of Proposition 12.11 that Pj d Xl [P] £ Hd-d r Thus 
[P] £ Hd-d ■ The last conclusion then follows from part ([3]) of Proposition 12.11 

□ 

In equations ( 13.1 ft or ( 13.21) . the vector fields {Zj} are written in terms of the 
Euclidean derivatives. Because these equations are in upper-triangular form, it is 
easy to solve for the Euclidean derivatives in terms of the vector fields. 

Proposition 3.2. For each 1 < j < N let Zj denote either Xj or Yj. Then there 
are polynomials Q l k £ l-idi-d k such that for ip £ 5(18^)., 

N N 

^M(x) = z fc M + E Ql(x)ZH(x) = z*M(x) + E z iW\{*)- 

dl>d k d t >d k 

Proof. We argue by reverse induction on the index k. When k = N it follows from 
equation (13.11) that = Zn = X^ = Y^. To establish the induction step, suppose 
that the conclusion of the Proposition is true for all indices greater than k. From 
equation (13. ip and the induction hypothesis, for either choice of Z k we have 

N N N 

d m >d k d m >d k di>d m 

N 



Zi\ 



= ZM~ E P kZm[^\- E [ E 

dm>d k di>d k d k <d, n <d t 

But according to part of Proposition 12.11 P™ Q l m G Hd t -d k , and this completes 
the proof. □ 



3 Dcspitc sonic risk of confusion, we do not introduce different notation for the coefficients of Xj 



and Yj. 
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For i/j £ «S(R JV ) and t > set ^f(x) = ^(t _1 ■ x). Then multiplication by 
a polynomial P £ "H^ is an operator homogeneous of degree in the sense that 
P(x)^ 4 (x) = t d (P(/)) 4 (x), and the vector fields Xj and Yj- are operators homogeneous 
of degree — dj in the sense that Xj[(p t ](x) = t~ dj (X,<^)i(x), Yj-[<^](x) = t~ dj (Yj(p) t (x.). 
In particular, the commutators [X^X^.] and [Y^Y^] are vector fields which are ho- 
mogeneous of degree —(dj + dk)- It follows that we can write 

[Xj, x k ] = Yl ^(x) d m = Yl R M*) z ™ with Q?*> R T,k e , 

d m >dj+d k dm>dj+dh 

[Yj, Y k ] = <fe( x ) ^ = E ^( x ) z - with Q&> e Hi^-d,-*. 

d m >dj+d k d m >dj+d k 

If the operators {X,} and are bases for a Lie algebra (as in the case of left- or 
right-invariant vector fields), the coefficients {R™f,} and {R^k} are constants. 

Equations (13. ip or (13. 2 p express the vector fields {Z k } in terms of the standard 
derivatives {dj}, and Proposition 13.21 expresses the standard derivatives in terms 
of the vector fields. We shall need analogous identities for products of r vector 
fields Z kl ■ ■ ■ Z kr or products of r Euclidean derivatives d kl ■ ■ ■ dk r ■ The formulas are 
somewhat complicated, since they involve products of operators of various lengths. 
To help with the formulation of the results, it will be convenient to introduce the 
following notation. 

Definition 3.3. Let k\,...,k r £ {1,...,N} be a set of r integers, possibly with 
repetitions. 

(1) For any non-empty set U C {1, . . . , r}, put 

du — /;d ke , and 

i&J 



3(U) = {me{l,...,N}\d m >d u } 



Note that if U consists of two or more elements and m £ 3(U), then d m > 
sap ieU d kt . 

(2) For each integer 1 < s < r letU r s denote the set of partitions of the set {1, . . . ,r} 
into s non-empty disjoint subsets U = {U\, . . . , U s }. 

The following Proposition then shows how to write products of vector fields in terms 
of products of Euclidean derivatives. 

Proposition 3.4. Let k\,...,k r £ {1, . . . , N} be a set of r integers, possibly with 
repetitions. For 1 < £ < r, let Z kl denote either X ke or Yk r Then there are polyno- 
mials P™* £ T-Ld me -d Ul such that 



z kl ---ZkM = Yl E E ••• E d mi ---d ma [p^---p^]. 

s=l (Ui,-,U e )GAl mieJ{Ui) m s £3(U s ) 
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If s = r, so that Uj = {%} , the polynomial P™ J (x) = 1. 



Proof. We argue by induction on r. The case r = 1 is contained in equation (13. 2p . 
so suppose we are given vector fields {Z kl , . . . , Z kr+1 } where each Z kl is either X ke or 
Y kr Then by induction 

Zki ■ ■ -Zkr+i VP] = z kl ■ ■ ■ z kr [z kr+l [tp]] 

= E E E •■■ E d mi ---d ms [p^...p™;[z kr+ M)}- 

s=l (Ui,...,U s )eU; mi€3(Ui) m s £3(U B ) 

(3.3) 

Since we can write Z kr+1 [ip] = d kr+1 [tp] + E{ m | dm >d fcH . 1 } d ™[P%. + ^} where P fe ™ +1 e 

U dm - dkr+i , the derivative d mi ■ ■ ■ d ms [P™ 1 ■ ■ ■ P™° [^JV]]] m the last line of Q 
can be written 



d mi ■ ■ ■ d ma [P^ 1 ■ ■ ■ P™ s s [Z kr+1 

= d mi ---d ma [P^---P™;[d kr+ M}} 



d m >d kr+1 



d m i---d m Ar + i[ P Ui--- p uM 

- d mi ■••d ma [d kr+1 [P^ 1 ■ ■ ■ P { 



o!m>a! fcr+1 

a!m>a!fc r+1 

The terms in the first and third lines of the last expression have the right form 
for the case r + 1. Thus in the term d mi ■ ■ ■ d ms d kr+1 [P^ 1 • • • Pu a a ' l P\ , r has been 
replaced by r + 1, the set {1, . . . , r, r + 1} has been decomposed into s + 1 subsets 
{Ui, . . . , U s , U s+ i} where U s+ \ = {k r+ i}, and P^ r+1 (x) = 1. The same is true for 
each term d mi ■ ■ ■ d ms d m [P™ 1 • • • P^P^M , except that P^ +i = P£ +1 . 

For terms in the second and fourth lines, we use the product rule; we write 
<9fc r +i \Pjji ' ' ' p u s s ] an d d m \Pjji ' ' ' p u s s ] as a sum °f s t erms - Consider, for example, 

d mi ---d ms [d m [P^}P^---P™;P^] 



dmi---d m \( d m [P^]PZ +1 )P^---Pu a 

dm>d k 



K r+1 



dmi---d ma [P^P^---P^] 



FLAG KERNELS ON HOMOGENEOUS GROUPS 14 

where P~ n = V, d^P^FT e H d d , . These terms also have the 

right form for the case r+1, since we now let Ui = UiL){k r+ i} so that {ki, . . . , k r+ {\ = 
XJ\ U U-i U ■ ■ • U U s . This establishes the Proposition. □ 

The next result shows how to write products of Euclidean derivatives in terms of 
products of vector fields. Since the proof is similar to that of Proposition I3.4[ we 
omit it. 

Proposition 3.5. Let ki,...,k r G {1,...,N} be a set of r integers, possibly with 
repetitions, chosen from the set {1, . . . , N}. There are polynomials Q™* G Hd me -d Ue 
such that 

r 

d kl -'-d k M=Yl E E E Z mi ■ ■ ■ ZmAQvl ■■■Qu^l 

s=l (Ui,...,U a )eU; mie0f(J7i) m a £3(U a ) 

Here each Zj is either Xj or Y y If s = r, so that Ui = {ki}, the polynomial 

q™;(x) = i. 

4. Normalized bump functions and their dilations 

4.1. Families of dilations. 

Fix the family of dilations on M. given in equation (12.11) . We introduce an N- 
parameter family of dyadicj^ dilations. For / G L 1 (IR Ar ) and / G 7j N set 

2 1 ■ x = (2 _dlil xi, . . . , 2~ dNiN x N ), 

& (4-1) 
[/]j(x) = 2 h /(2 J -x). 

Then 1 1 [/]/| | L i( R jv) = ll/IU 1 ^)- The set of monotone increasing indices is denoted 
by 

E N = {/ = . . . , i N ) G Z N | h < i 2 < ■ ■ ■ < i N } . (4.2) 

When we consider flag kernels corresponding to the decomposition 21 given by N = 
cl\ + • ■ ■ + a n and M. N = M. ai © ■ ■ • © M a ™ , we consider the n-parameter family of 
dilations parameterized by n-tuples I = . . . , i n ) G Z": 

2 1 ■ x = (2 h ■ xi, . . . , 2 in ■ x„), where 



2* • x, = (2-^x pv 2- d «* k x qe ), and 

n 

[/]/(x) = 2 h /(2^-x). 



(4.3) 



The set of monotone increasing indices in this case is denoted by 

£ n = {I = {n,. . .,i n ) ez n \n <i 2 < ■ ■ ■ <i n ) . (4.4) 



4 In Section IH1 we shall use a continuous version of this dyadic family. If t = (fx,... , tjv) with 
each tj > 0, we will set / t (x) = f(t^xi,. . . ^^xn)- 
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Given the decomposition 21, there is a mapping p a : S n (->• E N given by 




P%(k, ■ ■ ■ , i n ) = ( k, ■ ■ ■ , k , *2, • • • , k , ■ ■ ■ , in, ■ ■ ■ , i n ) ■ 



(4.5) 



We shall want to write flag kernels as sums of dilates [tp]j of normalized bump 
functions tp. Roughly speaking, a family of functions {tp a } in Cq°(R ) or <S(R ) is 
normalized if one has uniform control of the supports (in the case of C£°(R )) and 
of the semi-norms ||<^ a ||( m ) or ||</J a ||[M]- The following definition will simplify the 
precise statements of our results. 

Definition 4.1. 

(1) If ip, (f G C(f(lR n ) ; then tp is normalized in terms of ip z/@ there are constants 
C, C m > and integers p m > so that: 

(a) If the support of ip is contained in the ball B(p), then the support of tp is 
contained in the ball B(Cp). 

(b) For every non-negative integer m, \\<p\\(m) < C m \ \tp\ \( m + Pm ) ■ 

(2) If ip, ip G 5(lR n ) ; then ip is normalized in terms of ip if there are constants 

Cn > and integers pn > so that \\ip\\[N] < Cn \\ip\\[N+ PN } for every non- 
negative integer N . 

(3) If P, P are polynomials, then P is normalized in terms of P if P is obtained from 
P by multiplying each coefficient by a constant of modulus less than or equal to 



If tp g C^°(R Ar ), it is sometimes convenient to write tp as a sum of products of 
functions of a single variable. That this is possible follows from the following fact. 

Proposition 4.2. Let tp e C£°(R N ). Then for each a E N N and 1 < k < N there 
are functions tp a ^ G C^°(R ) so that 



where for any M > 0, there is a constant Cm such that \c a \ < Cm (1 + \ot\) M . 

Proposition 14.21 follows easily by regarding tp as a periodic function in each variable, 
and then expanding tp in a rapidly converging Fourier series. 



We shall sometimes use the expressions 'normalized with respect to' or 'normalized relative to' 
as a variant of 'normalized in terms of. 



1. 
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4.2. Differentiation and multiplication of dilates of bump functions. 

In this section we study the action of differentiation or multiplication by a homo- 
geneous polynomial on dilates of bump functions. The key results are Proposition 
14.71 and Corollary 14.81 below. We begin with the following result which follows easily 
from the definitions and the chain rule: 

Proposition 4.3. Let ip G S(R N ). Then 

2 +dktk <W]/(x) = and 2~ dk * k x k [^/(x) = [x k V>]j. (4.6) 

More generally, if I = . . . , z'at) £ 1^ , and a G N N , then 

2+[* J l 9°^]/ = [d a ^]j and 2- [q - /1 x q [^] / = [x>] J5 (4.7) 

where [«■/[ = J]f=i a k d k i k . 

We will frequently use the following generalization of the second identity in equa- 
tion dUTJ). 

Proposition 4.4. Let P G ~Hd where d < d\. If I G -E^, there is a polynomial Pi G 
T-Ld, normalized in terms of P, so that P(x)[^]/(x) = 2 d ' H [Pjr-0] (x) /or ^ G iS(M Ar ). 

Proo/. Write P(x) = Y. a ^ d c a x " witn = i a = ( a i> ■ ■ ■ > «jv) G | otidi H h 

a^dx = d}. Since d < di, if a G fid we have ccj = for j > I. According to 
Proposition [OJ it follows that P(x)[^]/(x) = E aG ^ d ^2^ [x^J/x), and 

2-1 2-1 2-i 

Jo • /J ^ Qrn,drrj"m H ^ ^ ^ra^tn ^ ^ C^mdmij'l ^m) 
m=l m=l m=l 

2-1 



III - I, 



m=l 



Thus 



P(x)[-0]/(x) = 2 di < [ ^ c^ELii^Mx^ 



^-l 



But since / G £tv, each exponent — ^2 m=1 ot m d m {%i — i m ) < 0. The proof is complete 
if we set P/(x) = £ ae8 c a 2^^^ amdm ^- im ^x. a . □ 

Remark: Equation (14.61) shows that the operator 2 dk%k d Xk applied to the /-dilate 
of a normalized function tp is the /-dilate of a function tp normalized in terms of tp, 
and multiplying the /-dilate of normalized function tp by 2~ dk%k x k is the /-dilate of 
a function tp normalized in terms of tp. Thus at 'scale /', the operators 2 dklk d Xk and 
2~ dk%k x k are 'invariant'; they map the collection /-dilates of normalized functions to 
itself. 

A key observation, which is used when we consider convolution on homogeneous 
nilpotent groups, is that we can replace the operator 2 dk%k d Xk with the operator 
2 dklk Z k , or conversely the operator 2 dk%h Z k by the operator 2 dklk d Xk , at the cost of 



FLAG KERNELS ON HOMOGENEOUS GROUPS 17 

introducing an error involving terms 2 dl%l Z\ or 2 dlH di where / > k multiplied by a 
'gain' 2~ dk ^ ll ~ %k \ The precise statement is given in Proposition 14.51 below. 

Let P l k £ %di-d k be the homogeneous polynomials that are coefficients of a vector 
field Zk as in equations (13.11) or (13. 2p . and let Q l k £ %di-d k be the polynomials in 
Proposition 13. 21 Since d\ — d k < di, we can use Proposition 14.41 to write 

J^(x)[^Mx) = 2*'(*-*)[^] / (x) ) 

Qi(x)[mx) = 2 iiidi - dh) [QiMM), 

where P l k T , Q l kl £ T-Ldi-d k are normalized relative to P k and Q l k 

Proposition 4.5. Let I £ En, and let {P k j} and {Q l k j} be the homogeneous poly- 
nomials defined in equation ( 4-8 )- Let ip £ C£°(R ) (respectively ip £ <S(R )). Then 

N 

(2 d ^z k Mj = (2 d ^d k M! + ^ dk{H - lk) {2 d ^d l )[P l k ^} I , 

di>d k 
N 

(2 d ^d k M I = (2 d ^Z k M I+ 2- d ^-^\2 d ^Z l )[Q l k ^] I . 

di>d k 

The functions {P k jip} an d {Qi /0} are normalized with respect to ip in Cq°(M. n ) 
(respectively in S(R N ) ). 

Proof. The first identity follows immediately from equation (13. 2p . Proposition I4.4[ 
and equation (14.81) . To obtain the second, use Propositions 13.21 and equation (14. 8p ; 
we have 

N 

2 d ^d x Mi = 2 dklk z k ^}i + 2 dklk E z iQl(*Mi 

l=k+l 

N 

= 2* ki *Z k [ip\ I + Y, ^ il - ik \2 dlil Z l )[Q l k ^} I , 

l=k+l 

which is the desired formula. □ 

Corollary 4.6. If tp £ Cq°(R n ) (respectively tp £ S(R N )) and if I £ E N , then there 
is a function tp £ Cq°(R n ), (respectively tp £ S(M. N )), normalized with respect to tp, 
such that 2 dkik Z k [<p\ I = [<p\ r . 

We shall need an analogue of Proposition 14.51 for r-fold products of vector fields 
or Euclidean derivatives. If {P^ 1 } and {Qu'} are the polynomials appearing in 
Propositions 13.41 and I3.5[ we use Proposition 14.41 to define polynomials PyS and 
Qui I by the formulas 

^(x)M,(x) = 2-^-^)[P^]/(x), 

Q^(x)^]r(x) = 2^^-^)[g-;>] 7 (x). 
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Proposition 4.7. Let ki, . . . , k r G {1, . . . , N} be a set of r integers, possibly with 
repetitions, and let I G E^. Let Z% denote either Xg orYg. Then 

(2 dk i ik iZ kl ) ■ ■ -(2 dk ^Z k 



= Y Y Y ■" Y 2 _E '= iEtec/ i^ (im ' _ifei) 

s=l (U 1: ...,U B )GJr m^fUi) m s £3(U s ) 

(2 d ^d mi ) ■ ■ ■ {2^'d m .)[P^j ■ ■ ■ P%fl>]i, 

and 

(2 dk ^d kl )--{2 d ^d kr Mj 

r 

= Y Y Y ■" Y 2 _E '= iE * 6£/ i^ ( ^ _ifc * ) 

«=i (t/i,...,c/ s )ewj'mie:r(E/i) m a e3(u a ) 

(2 d ^Z mi ) ■ ■ ■ (2*-*™«Z ro .)[Q^ • • • Qu s s ,Mi- 
In either identity, if s = r so that Ui = {ke}, 1 < I < r, the polynomials P™*(x) = 

g£(x) = i. 

Proof. Using Proposition 13.41 we have 

(2 J '.''.a ti )...(2^^a l j[fl J 

s=l U&Al miG3(C/i) m s ea(C/ s ) 

=2 £*.*.^ ££•••£ 

s=i «ew; miea(c/i) m s ea(i/ s ) 
— ^ ^ ^ ... ^ 2<£'i dkeike ~I='i imedUe 

s=l UeUr mi €3(Ui) m s £3(Us) 

(2*-i*»iz mi ) • • • (2 d - 4 -z m j [g^ • • -g^] r 

This completes the proof of the first identity. The second identity is established in 
the same way. □ 

In Proposition HTJ we can rewrite the exponent in the power of 2 as follows. Having 
chosen U = (U\, . . . , U s ) G W s , there is a unique mapping a — au '■ {1, ■ ■ ■ , r} — > 
{1, . . . , s} so that £ G for 1 < I < r. Then 

r s r r r 

dk e ik e ~ im e du e = df.fi f. ( — = ~ ^ ] d ke (i maU) —ik e )- 

i=\ 1=1 1=1 1=1 1=1 

Since £ G t/^ and m a{e) G it follows that d m<r{f) > d u<£) > d kt , with 

equality only possible if U a (i) = Thus if I G En, it follows that i m ,„ > i^, 
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in which case 

where we can take e = d\ > 0. 

We can now recast the identities in Proposition 14. 71 in a way which, although losing 
some information, makes them more useful and easier to work with when dealing with 
flag kernels 

Corollary 4.8. Fix a decomposition R N = R ai © • ■ ■ © R an . Let ki, . . . , k r G 
{1, ...,N} be a set of r integers, possibly with repetitions, with ke G J^Jj For 
1 < I < r, let Z kl denote either X kl or Y kr Let I G S n , and let ip G S(M. N ) or 
C^(R N ). 

(1) The function (2 dfc i v < 1 )Zfc 1 ) ••• (2 dfe <-M>-) Z kr i can be written as a finite sum of 
terms of the following form. Decompose the set {1, . . . , r} into two disjoint com- 
plementary subsets A and B with ir(j) ^ n for any j G A and 5^0. Let 
B = {£i, . . . ,£ s }, and choose integers fa = {ra 1 , . . . ,m s } so that each ra t G Ji t . 
Then a typical term in the expansion of (2 dfe i v (i) Z kl ) • ■ ■ (2 dkrl7r <- r '> Zk r )[ip]i is 

2- e £ je A(M,)+i-M J) ) (2 dm ^wd mi ) ■ ■ ■ (2 dmsl ^d ms )[ijA,B,rh\i 

where i/jA,B,m is normalized relative to ip. 

(2) The function (2 dk i ln '- 1 ')dk 1 ) • ■ ■ (2 dkrln Mdk r )[tp]i can be written as a finite sum of 
terms of the following form. Decompose the set {1, . . . , r} into two disjoint com- 
plementary subsets A and B with 7r(j) ^ n for any j G A and 5^0. Let 
B = . . . ,£ s }, and choose integers fa = {mi, . . . ,ra s } so that each ra t G Ji t . 
Then a typical term in the expansion of (2 dfe iMi)<9 fcl ) ■ • • (2 dk ^( r ) d kr )[i/j]i is 

2-«£i 6 A(Mi>+i-Mi)) (2 d ^wZ mi ) ■ ■ ■ (2 d -^Z ms )[iJ ABifh } I 

where ipA,B,m is normalized relative to tp. 

Remarks 4.9. 

(a) The essential point of part ([T]) in the Corollary is that, when replacing the oper- 
ator 

(2 d ^wZ kl )---(2 d ^(r)Z kr )[iP]j 
with a sum of terms of the form 

2- e £ j6 A(M,)+i-Mi)) (2 d -iMD<9 mi ) • • ■ (2 dm '^')d ma )[^A^,fn]l, 

either the factor (2^^) Z kl ) is replaced by a term (2 dm ^ v w d mi ) where the coor- 
dinate x me belongs to the same subspace as x kl and hence has the same dilation 
or it is replaced by the gain 2~ e ^ { -^+ 1 ~ %iv{ - 1 ^ . Part ([2]) is the same assertion 
with the roles of the vector fields and Euclidean differentiation interchanged. 

6 Recall from page[S]that tt : {1, . . . , N} — > {1, . . . , n}; for any coordinate Xj, 1 < j < N, then Xj 
is a coordinate in the factor R a "U) , and J n (j) is the set of indices of all the coordinates in IR'Mj) . 
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(b) One term in the expansion of (2 dfc i i?r ( 1 )Z/ Cl ) • • • [2 dkrlw ^ Z kr )[ r 4 ) }i in part (CQ) arises 
by letting A = so that B = {1, . . . , r}, and then choosing m r = k r . We have 
seen that in this case the function ipA,B,m = so we § e ^ the term 

{2 d ^wd kl ) ■ ■ ■ (2 dk ^d kr )[i)} 1 . 

Every other term in the expansion then involves either a gain 2~ e ( ln< - e '>+ 1 ~ ln <- e ^ or 
the replacement of a variable x kt by a different variable x me with d mi > d k( . We 
shall say that any term of the form 

2- e £ J6 A(M J )+i-M,)) (2 d ™^md mi ) ■ ■ ■ (2 dm ^d ms )[^ AjB ,rn}i, 

where either A ^ or some d me > d ke is an allowable error. Thus the difference 

(2 d *iMDZ fel ) ■ ■ • (2 d ^M Zkr )[tlj] I - (2 d ^d kl ) ■ ■ ■ (2 d ^(r)d kr )[^] I 

is a sum of allowable errors. 

(c) Since Euclidean derivatives commute, it follows that if a is any permutation of 
the set {1, . . . , r}, then the difference 

( 2 *iMDZ fcl ) • ■ ■ (2 d "^ir) Zkr )[ij}j - (2 d ^Z K(1) ) ■ ■ ■ (2^M')\ w )Mr 

is a sum of allowable errors. 



5. Cancellation 

A function is often said to have cancellation if its average or integral is zero. For 
our purposes, we shall need a more refined notion involving integrals in some subset 
of variables. Let J = {j u . . . , j s } C {1, . . . , N}. If ^ G S{R N ), write 

/ V(x) dxj — / ip(xi, . . . ,x N ) dxj x ■ ■ ■ dx js . (5.1) 

J JR" 

Note that J "0( x ) <^ x j is then a function of the variables x k for which k ^ J. We say 
that a function ip has cancellation in the variables {xj t , . . . , Xj s } if J RS ip{x) cZxj = 
where J = {ji, . . . , j s }. Later in Section 15.21 we shall give additional definitions of 
'strong cancellation' and 'weak cancellation' for a function ip. 

5.1. Cancellation and the existence of primitives. 

We begin by showing that cancellation in certain collections of variables is equiv- 
alent to the existence of appropriate primitives. 

Lemma 5.1. Let ip £ S{R N ), and let J k C {l,...,n} be non-empty subsets for 
1 < k < r. If the sets {Jk} o,re mutually disjoint, then the following two statements 
are equivalent: 

(a) Fori <k <r, / ^(x) dx Jfe = 0. 
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(b) There are functions ipj lt ...j r G S{M. N ), normalized with respect to ip, such that 



21 



" d h--- d iri>h,-,jr(X-)- 

jl€Jl jr&Jr 



Moreover, if the function ip G C, 



Cg°( 



o i 



, then we can choose the functions ipj lt . 



normalized with respect to ip. 



It follows easily from the fundamental theorem of calculus that (jb]) implies (jgj). 
The main content of the Lemma is thus the opposite implication. This will follow 
by induction on r from the following assertion. 

Proposition 5.2. Let ip G C%°(M. N ). Suppose that J±, J 2 C {l,...,iV} are non- 
empty and disjoint. If f ^>(x) dxj x = f i/)(x) <ixj 2 = 0, then for each k G J\ there 
is a function ipk G C^°(IR Ar ) ; normalized relative to ip, so that: 

(i) we can write ip = ^2 k€Jl dkipk', 

(ii) for each k G J\ we still have J ipk(x) c/xj 2 = 0. 

If ip G S(M. N ), the same conclusions hold except that the functions ipk G S(M. N ) and 
are normalized relative to ip. 

Proof. By relabeling the coordinates, we can assume that J\ = {1, . . . , k}, and that 
J 2 C {k + 1, . . . , N}. Suppose that ip has compact support in the set B = {x G 



\X 



j\ < a,j}. Choose x G with support in [—!,+!] such that J R x( s ) ds 



and put Xj{t) — a j x( a j t) so that Xj is support in [— aj, +%] and still has integral 
equal to 1. Put 

(pi(x) = ip(x) - Xi(xi) / ip(s,x 2 ,...,x N )ds 



and for 2 < j < k, put 



[n»(u) 



ip(si, . . . , Sj_i, Xj, . . . , xn) dsi ■ ■ ■ dsj-i 
TxK^) / ip(s 1 ,...,Sj,Xj +1 ,...,x N )ds 1 ---ds j . 



Then the functions {(fij} have the following properties. First, since J ip(x) dxj l = 0, 
the second integral in the definition of the last function tp^ is zero, and hence ip(x) = 
12j=i ^j( x )- Next, it is clear that each <pj is supported in the set B. Finally, for 
l<J<k, 

ipj(xx, . . . , Xj-!, S, Xj+i, . . . , x N ) ds = 0, 
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so if we put 




• • • j x j—li s , x j+l: • • • j x n) ds 



ipj(xi, . . . , Xj-i, s, x j+1 , . . ., x N ) ds, 

then ipj is supported on the set B, and <Pj(x) = djijjj(x). It is clear that one can 
estimate the size of the derivatives of the functions {ipj} in terms of the derivatives of 
ip, so xjjj is normalized in terms of ip. Moreover, since J ip(x) (ixj 2 = 0, it follows from 
their definitions that J </>j(x) dxj 2 = 0, and hence J ipji'x) <ixj 2 = 0. This completes 
the proof if ip £ C^(R N ). If ip £ S(R N ), the proof goes the same way. One only has 
to observe from equation (15.21) that the functions ipj £ S(R N ). This completes the 
proof of Proposition I5.2[ and hence Lemma 15.11 is also established. □ 




5.2. Strong and weak cancellation. 

In this section we introduce two kinds of cancellation conditions that can be im- 
posed on functions in S(R N ) or Cq?(R n ). As discussed in Section [TTT1 these concepts 
are used in the context of the decomposition WL N = M. ai ©• • -©IR"" given in (12. 3p where 
cti + • • • + a n = N and each aj > 1. Recall that if x £ M. N , we write x = (x 1; . . . , x n ) 
where x; = (x Pl ,x pl+ i, . . . ,x qi ) £ lR ai . We then let J; denote the set of integers 
{ Pl , Pl + l,..., qi }. If <peS(R N ), set 

/ <^(xi, . . . ,x n ) d*{ = / (p(x 1 ,...,x n )dx pt ---dx qt . (5.3) 



Definition 5.3. Fix the decomposition li2.^J\) . and let ip £ <S(R JV ). The function (p 
has strong cancellation if and only if 

/ y?(xi, . . . ,x n ) dx e = 0, 1 < i < n. 

JR a l 

That is, (f has strong cancellation if and only if it has cancellation in each collection 
of variables {x Pl , . . . , x qi } for 1 < I < n. 

Remark 5.4. It follows from Proposition 15.21 that <p has strong cancellation if and 
only if there are functions (Pj x j n normalized with respect to ip so that 

We now introduce a weaker cancellation condition. 

Definition 5.5. Fix the decomposition / TO) ofR N . Let ip £ S(R N ) or ip £ C^(R N ), 
and let I = (ii, ■ ■ - i n ) £n- The function ip has weak cancellation with parameter 
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e > relative to the multi-index I if and only if 

*= E ( II 2_e(is+1 " is a*-) [^,B, h ,.., 3r \ 

AuB={l,...,n} s£B jieJ ai j r eJ ar 

A={ai,...,a r } 
n£B 

where each pA.B.j 1 ,...,j r is normalized relative to p. Here the outer sum is taken over 
all decompositions of the set {1, . . . ,n} into two disjoint subsets A and B such that 
n G A. 

According to Remark 15.44 <p has strong cancellation if it can be written as a sum 
of functions of the form dj 1 ■ ■ ■ dj n tp; i. e. as n^-derivatives of functions where there is 
one derivative in a variable from each of the n subspaces lR a * . Definition 15.51 imposes 
a weaker condition; a function tp has weak cancellation if again it is a sum of terms, 
but the term dj 1 ■ ■ ■ dj n tpj lr __j n is itself replaced by a new sum of terms. If a derivative 
dj l does not appear, so that the term does not have cancellation in the subspace R fl! , 
there is instead a gain given by 2~ e ^ l+1 ~ ll \ 

Remarks 5.6. There will be occasions when we will use the fact that a function 
tp has weak cancellation with respect to / G S n to draw inferences about existence 
of primitives and smallness in only some of the subspaces {IR ai }. In particular, the 
following assertions follow easily from Definition 15.51 

(1) Suppose that I G £ n and that <p has weak cancellation relative to /. Let M C 
{1, . . . , n}. Then 

E ( II 2- ( — } ) 9 js ) [VA,B, n ,.., jr ] 

AUB=M SGB jieJ ai jr&Jar 

A={a 1 ,...,a r } 

ngB 

where each (pA,Bj lt ...,j r is normalized relative to ip, and where the outer sum is 
over all subsets B C M, with the understanding that if n G M then n ^ B. 

(2) In particular, if we take M = {1}, it follows that if ip has weak cancellation 
relative to I G S n , we can write 

Ol 

p = Y,dr[Pr]+^-^p Q 
r=l 

where {tpo,tpi, . . . , <p ai } are normalized relative to p. Here, the derivatives are 
with respect to the variables in the first subspace 

We can also characterize weak cancellation in terms of the "smallness" of integrals, 
to be compared with the characterization of strong cancellation in terms of the 
vanishing of integrals given in Lemma 15.11 For any partition {1, . . . ,n} = AU B 
with A = {ji, . . . ,j a } and B = {hi, . . . , kb}, write x G WL N as x = (x^, x B ) where 
x^ = (xjj, . . . , Xjj), x B = (x fcl , . . . , x fc J. Let dx.A = dx.^ ■ ■ ■ dx.j p and let dx B = 
dx kl ■ ■ -dx kb . 
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Proposition 5.7. Let e > and I £ S n . A function ip £ Q' (1R JV ) /ias weafc cancella- 
tion with parameter e relative to I if and only if for every partition {1, . . . ,n} = AUB 
into disjoint subsets with n ^ B , we have 

where ipA £ ^(©jg^R^) is normalized relative to ip. IfipE Cq°(R jv ) ; £/ie functions 

^ec »(e i6 ,ri). 

Proof. It is clear that if ^ has weak cancellation, then it satisfies the condition of 
Proposition 15.71 so the main content is the opposite implication. 

Let xi £ Qj°(lR ai ) have support in the set where |xj| < 1, with J Rai x( x c^Q = 1- 
For V £ «S(R JV ) and 1 < / < JV, define 

Li [-0] (x) = V (x) - (xi) / ^ (x) dxj , 

M,[V](x)=Xi(x l ) / V'(x)chQ. 

It is easy to check that the operators {L 1; . . . , L n , M 1; . . . , M n } all commute, and 
that 



M^](x)dx,= / ^(x)rfxj, 



LzM(x) cbq = 0. 



Since L\ + is the identity operator, if x/j £ S 

n 

^) = Y[(L l + M l M^) 



i=i 



AuB={l,...,n} j£A keB AuB={l,...,n} 

It is clear that the functions {^a,b} are normalized relative to ip. Note that 
(T\M k )[i>]^)= (T[xk(xk)) [ ^(xA,x B )dx B 



and for every j £ A we have 



/ 



■0a,b(x) dxj = 0. 



Now suppose that ^ satisfies the hypotheses of Proposition 15.71 For every decom- 
position {1, . . . , n} = A U B, it follows that 

(n W)m(x) = [n2^ +i - !t) ]^(x.)(n»(x fc 

fees fees fce-B 
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where ipA is normalized relative to ip. But then 

^,b(x)= (n^)(n Mfc )^w 

j&A keB 

= [[12-^] (ii l [ ^ ]{xa) (n ■ 

fceB jGA fcGB 

Now the function ^ YljeA ^i) H>a] ( x a) ( YlkeB Xfc( x fc) J has cancellation in each of the 
subspaces W 1 ' for j £ A. If we write A = {ai, . . . a r }, it follows from Lemma [5.11 
that we can write this function as a sum of terms of the form dj 1 ■ ■ ■ dj r [ipA,B,a] where 
the variable Xj k belongs to the subspace M. aa k . This then makes it clear that ip has 
weak cancellation, and completes the proof. □ 

6. The structure of flag kernels 

In this section we establish four important properties of flag kernels. The first 
(Theorem 16. 1 p shows how to decompose a given flag kernel into a sum of dilates of 
compactly supported functions with strong cancellation. The second (Theorem 16.81) 
shows conversely that a sum of dilates of Schwartz functions with weak cancellation 
converges to a flag kernel. The third (Theorem 16. 13[) shows that the cancellation 
conditions imposed in part (jb]) of Definition 12. 31 can be relaxed. The fourth (Theorem 
I6.15P shows that the family of flag kernels is invariant under appropriately chosen 
changes of variables. 

6.1. Dyadic decomposition of flag kernels. 

Every flag kernel is a product kernel in the sense of Definition 2.1.1 of [NRSOlJ, 
and Corollary 2.2.2 in that reference implies that if /C is a flag kernel, then there 
are functions ip £ Cq°(R ), / £ Z n with strong cancellation and uniformly bounded 
seminorms | |<£ J | |( m ), supported in the set where | < Nj(xj) < 4 for every j, such that 
/C = Yliez^VP 1 ]! m t ne sense of distributions. Recall that £ n = {I = (z 1; . . .,i n ) £ 
Z n | i\ < %2 < • • • < i n }- Since a flag kernel satisfies better differential inequalities 
than a general product kernel, one expects that it should be possible to write /C as 
a sum of dilates of functions if 1 £ C^°(lR Ar ) where the dilations range only over the 
set £ n instead of over all of Z n . Such a result is stated in Corollary 2.2.4 of |NRS01j . 
but the precise statement there is not correct because one must allow additional 
terms involving flag kernels adapted to coarser flags. This section provides a correct 
statement and proof. 

Theorem 6.1. Let JC be a flag kernel adapted to the standard flag T 

(0) C lR a " C IR"™- 1 © lR a " C • • • C IR" 3 © • • • © lR a ™ C IR" 2 © • • • © lR a ™ C R N fTZ4l) 

Then there is a decomposition 1C = /Co + fC\ + • • • + fC n with the following properties. 

(1) For each I £ £ n there is a function ip 1 £ Cq°(WL n ) so that /Co = J2ies n l 1 ^ 1 ] 1 w ^ 
convergence in the sense of distributions. Moreover: 
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(a) the support of each function ip 1 is contained in the unit ball B = {x £ 
R N | |x| < 1}; 

(b) there are constants C m > ( depending on the constants for the flag kernel 
K) so that for each I £ S n and all m>0, W^Wim) < C m ; 

(c) each function (p 1 has strong cancellation in the sense of Definition 15.31 ' for 
1 < j < n 

/ y?(xi, . . . ,Xj, . . . ,x n ) dxj = 0; 

(d) for each I £ £ nj </? 7 (xi, . . . ,x n ) = for |xi| < g. 

(2) For 1 < j < n, each Kj is a flag kernel adapted to a flag which is strictly coarser 
than J- ' . 

Recall that in Section IL~2l we distinguished between elements I = . . . , i n ) £ S n 
where we have strict inequality %\ < %i < ■ ■ ■ < i n ~\ < i n , and elements where 
some of the entries are equal. Underlying this dichotomy is the fact that S n is a 
polyhedral cone in Z n , the set of elements with strict inequality form the 'open' 
n-dimensional interior, and elements with various sets of equal elements correspond 
to lower dimensional faces. In Theorem 16. 1[ we may as well assume that the dyadic 
sum representing /Co extends over the interior n-tuples with strictly increasing com- 
ponents, leaving the "boundary n-tuples" with repeated indices to parametrize the 
dyadic terms of the sums representing the other /C,-'s. 

Using induction on the number of steps in a flag, Theorem 16.11 immediately gives 
us the following corollary. 

Corollary 6.2. Let K be a flag kernel adapted to a standard flag J 7 of length n. 
There is a finite collection of flags {J 7 ^}, 1 < k < n and 1 < s < b^, with the 
following properties. 

(1) For k — n, b n — 1 and T n> \ = T . 

(2) For each k < n, the flag J-k tS has length k and is strictly coarser than T ' . 

(3) For each (k, s) there is a uniformly bounded family of functions {ip{ s } C C™(M. N ), 
J £ £k, all supported in the unit ball and having strong cancellation relative to 
the decomposition of M. N corresponding to the flag Fk >s so that in the sense of 
distributions, 

n b k 

k=i s=i JeE k 

The proof of Theorem 16.11 relies on three preliminary results. The first, which 
gives the characterization of flag kernels in terms of their Fourier transforms, was 
established in |NRS01] . We briefly recall the relevant definitions. Let (R N )* denote 
the space of linear functionals on R , and for a subspace W C R , let W C (M^)* 
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be the subspace of linear functions which are zero on W. If J 7 is the flag in M. N given 
in (ED, the dual flag f G (R N )* is J 7 * given by 

(0) ± D (M a ") ± D (M "- 1 © K a ™) ± D • • • D (R° 2 © • • • © M a ") ± = (R JV ) ± . 

If we identify (R afe © • • ■ © lR a ") ± with (R ai )* © • ■ ■ © (R**- 1 )*, the dual flag becomes 

(o) c (R ai y c (R ai y © (M a2 )* c • ■ ■ c (R ai y © • ■ ■ © (ir™- 1 )* c (r^)*. (6.2) 

If e e (R^)*, we write f = (£i,...,£„) where & = (£i,i> • ■ ■ <= (^)*- The 

family of dilations on R N defined in equation (12. ip induce a family of dilations on 
(R^)* so that (r • x, £) = (x, r • £). We let |£| be a smooth homogeneous norm on 
(R )*, and if £j G (R° J )*, we let |^ | be the restriction of the norm to this subspace. 

Definition 6.3. A flag multiplier relative to the flag J 7 * given in (6.2) is a function 
m(£) which is infinitely differentiable away from the subspace = 0, and which 
satisfies the differential inequalities 

n 

\ d S ■ ■ ■ < c a + ■■■ + \ar 1 ^. 

We can now state Theorem 2.3.9 of |NRS01j as follows: 

Lemma 6.4. Let K be a flag kernel adapted to the flag J 7 . Then the Fourier trans- 
form of K, is a flag multiplier relative to the dual flag J 7 * . Conversely, every flag 
multiplier relative to the flag J 7 * is the Fourier transform of a flag kernel adapted to 
the flag J 7 . 

The next preliminary result provides a decomposition of test functions in <S(R ). 

Lemma 6.5. Let ip G «S(R JV ). Then there are functions {4> k } C C™(R N ), k = 
0, 1, 2, . . . such that 

oo 

V>(x) = ^2- fcQ ^(2~ fe -x), 

where Q is the homogeneous dimension of R N . Moreover these functions have the 
following properties. 

(a) Each ifj h is supported in the unit ball; 

(b) For any 5 > and any a G N N , there exists M G N depending on 5 and a so 
that 

sup |dV(x)|<|H| M 2- fe5 ; 

(c) If ip has strong cancellation, then each ip k has strong cancellation. 

Proof. Choose r] G C£°(R) supported in [—1, — |] U [|, 1] with r)(t) = r)(—t) such that 

+oo 
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for all t ^ 0. For any t G E, including t = 0, set r) (t) = 1 - T,kLiV( 2 ~ k t)- Then 
r/o G Cg°(R) is supported in [-1,+1], and r) (t) + J2T=iV(2~ k t) = 1 for all t G R. 
Recall that x — > |x| is a smooth homogeneous norm on WL N . Set 

f^(x)r7o(|x|) if fc = 0, 

\2 kQ i)(2 k •x)r/(|x|) if A; > 1. 

Then 2" fcc V c (2- fe -x) = ^(x)r/(2- fc |x|) for jfe > 1, and so ^(x) = J2T=o 2~ kQ ^ k ( 2 ~ k -x)- 
From the choice of i] and the definition of 770 it follows that each ijj k is supported on 
the set where |x| < 1, and this gives the assertion (|aj). 

Since ip G S(M. N ), it follows that for every MgN and every a G with \a\ < M 
we have 

|9Xx)|<||^|| [M] (l + |x| e )- M , (6.3) 

where |x| e is the Euclidean length of the vector x G M. N . If x belongs to the support 
of ijj k when k > 1, we have |x| > 2~ 3 , and so \2 h ■ x| > 2 h ~ 3 . Assertion (jb]) then 
follows easily from equation (16.31) and the fact that |x| dl < |x| e if |x| e > 1. 

Finally, suppose that ^ has strong cancellation, so that f Rak t/>(xi, . . . , x n ) dx^ = 
for 1 < fc < n. It follows from Lemma 15.11 that we can write if) as a finite sum of 
terms of the form ■ ■ ■ dj n ipj lt ...j n where coordinate in x^ for 1 < k < n, 

and each function ipj lt ....j n G <S(R ) is normalized relative to ip. Using assertions (jaj) 
and (jb]), we can write 

00 

k=0 

and so 

CO 

• • • ^^...^(x) = 2^+-+^+Q) dn ■ ■ ■ d jn iPl_ jn (2- k ■ x). 

fc=0 

Since each term 2 _fc ( d Ji H ^-"^c^ • • • dj^^ - n has strong cancellation, summing over 
a finite number of such terms establishes assertion (jcj), and completes the proof. □ 

Finally, we will need the following result which provides a decomposition of test 
functions in Cq°(R n ), 

Lemma 6.6. Let ip G Cq°(M ) have compact support in the unit ball, and suppose 
that if has cancellation in xi/ i.e. L 01 <£>(xi, X2, . . . , x n ) dxi = 0. Then there are 
functions {(pty C Qf^IR^) swc/i t/iat 



V?(xi,x 2 , . . . ,x n ) = 2^ 2~ kQx Lp s {X 3 ■ xi,x 2 , . . . ,x„), 

j=-oo 

where Q\ is the homogeneous dimension of IR ai . Moreover, these functions have the 
following properties. 
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(a) Each ^ is supported in the unit ball; 

(b) Each ipi is normalized relative to the function tp; 

(c) For —oo < j < we have v9 J (xi,x 2 , . . . ,x n ) = if |xi| < |; 

(d) If ip has strong cancellation, then each function ^ has strong cancellation. 

Proof. Choose t] e such that rj(t) vanishes if t > 2 or t < |, and such that 

J2i <0 v( 2 ' lt ) = 1 for < i < 1. Put Ai(x 2 , . . . , x n ) = 0, and for j < 0, put 



A*(x 2 , 



Xi(xi) 


= i?(2^| 


xi|) 


-1 


Xi(xi) 


= Xj(xi) 




• • j x n) 


= / <^( x i> x 2, • • • ,x„)x i (x 1 ) dx 


• • j x n) 





X 2 , • • • , x n) • 





1- 



We have X]j<o X?( x i) = 1 for < |xi| < 1, and since ip is supported in the unit ball, 
we can sum by parts for x x ^ to get 

^( x ) = ^ Xl ' " ' ' x n)Xj( x i) 

i<o 

= Yl b( x i'---' x «)Xi( x i) -Xj( x i)ai( x 2,---, x n) 

i<o 

+ Yl fe( X l) - Xj-l( x l)Mj( x 2, 

i<o 



Z^ 1 



X . 



Let 
Then 



2^,(2^ x l5 x 2 



, . . . , j\. n j 



y^fxj 



^(2^ ■ xi, . . . ,x n )xo( x i) - 2 jQl Xo( x iK-( x 2 , 
+ 2^ (xo( x i) - 2~ Ql xo(2 • x i))^(x 2 , 



and 



(6.4) 



(6.5) 



j=-oo 



The functions {a,} and hence also the functions {Aj} are infinitely different iable 
functions supported in the unit ball of lR a2 © • • ■ © R a ™ . Moreover, it follows from 
the fact that Xj( x i) is supported on the set |xi| < 2 J+1 that there is a constant C so 
that for each integer m, we have 



a 



j 1 1 (m) 



< C 2 jQl I 



(6.6) 
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The function Yll=-oo Xs( x i) is also supported on the set |xi| < 2 J+1 , and is bounded 
independently of j. We have 

Aj(x 2 , . . . ,x„) + / (^(xi) V" Xs(xi)dxi= / <£>(xi,x 2) • • • ,x n ) rfxi = 0, 

and it thus follows that we also have 

\\A 3 \\ {m ) <C2 jQl \\(p\\ {m) . (6.7) 

It is clear from our construction that each function (pi has compact support in the 
unit ball. It follows from equation (16. 4p that each (pi vanishes when |xi| < |, and also 
that if (p has strong cancellation, then J Rafe </2 J '(xi, x 2 , . . . , x„) dx^. = for 1 < k < n. 
Finally, equations (IQjl . flfTHjl . and (EZD show that ||<^||( m ) < C 2~ J ' Ql | \tp\ | (m) . This 
completes the proof. □ 



Let /C be a flag kernel adapted to the flag J 7 , and let m — K, be the flag multiplier 
on the flag J 7 * which is the Fourier transform of /C. Choose a function 77 e Cq°(R) 
supported in [|, 4] such that X^'ez Vi^t) = 1 for all t > 0. For each / = . . . , z n ) G 

. set 

Vl (0=v(2 li m---v(2 ln \U)- (6-8) 
Note that rjj is supported where ~ 2~* : ' for 1 < j < n. We establish 

Lemma 6.7. Lei m be a flag multiplier relative to the flag T* given in ( QQ| ). T/ien 

le£„ fe=l 

where m i/ie Fourier transform of a flag kernel relative to the flag T ' , and for 
1 < k < n, the function is the Fourier transform of a flag kernel adapted to a 
flag strictly coarser than J- '. 

Proof. Let 9 be a smooth function on IR supported where t > 10 such that 9{t) = 1 
for t > 20. Write 

m(£) = m{£)6 (|e re -i| + [l - 9 (\C n -x\ \tn\~ 1 ) ] = m(£) + 

On the support of 9' (|£n-i| Knl^ 1 ) we have |£ n -i| ~ |6n|- Also, by homogeneity we 
have 

IC'd^-iDl ^ C'l^-il 1 - 10 - 11 and < C|^r- W . 

Thus Lemma [6~4l implies that and mi(() are flag multipliers relative to the flag 
(I6.2p . On the support of m! we have |£ n -i| l^nl" 1 < 20. Thus we can group together 
the variables £ n and £„_i, and it follows that mi(() is a flag kernel relative to a flag 
coarser than J 7 *. Also > 10|£„| on the support of n\. 

Next write 

ni(0 = (lCn- 2 | la-il" 1 ) + rn(0 [l - (|£ n _ 2 | l^-il" 1 ) ] = MO + m 2 (0- 
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Since 6>'(|£ n _ 2 | x ) is supported where |£„_ 2 | ~ and \^ x \ > 10|f n |, it 

again follows that n 2 and m 2 are Fourier transforms of flag kernels relative to the 
flag J 7 *. On the support of m 2 we have |£ n -2| < 20 and |£ n -i| |£n| _1 < 20. 

Thus we can group together the variables £ n ,£ n _i and £ n _ 2 , and it follows that 
m 2 (£) is a flag kernel relative to a flag coarser than J 7 *. Also |£ n -i| > 10|£ n -i|, and 
|£n-i| > 10|Cn| on the support of n 2 . 
We proceed inductively to see that 

k=l 

where each m s (£ ) is the Fourier transform of a coarser flag kernel and m (£) = n n (£) 
is supported where |^| > 10|£j + i| for 1 < j < n — 1. 
From our choice of 77, it follows that 

1= v{2 n \Zi\)---v{2 Jn \U) 
= J2v{2 n \Zi\)---v{V n \U)+ E ^7 (S^Kil) - ■ -/7 (2^|^l) 

J£S„ J& n -E n 

Thus if Ej G £„ »7 ( 2il l6l) • ■ -V (2 in |^n|) ^ 1, there is an n-tuple J = (j u . . . ,j n ) such 
and integers 1 < r < s < n such that j r > j s and 77 (2 5 ' r |£ 7 .|) 77 (2 Ja |£ s |) ^ 0. Since 
77 is supported on [|,4], it follows that |£ r | < 4 2^ v < 4 2~ Js < 8|£ s |. However, 
on the support of m we have |£ r | > 10|£ s |. Thus on the support of m we have 
l = E./ e .^(2 J1 |6l)--^(2 J "|a|),and so 

n 

m(0 = E mo(0»7 (2 J1 |6l) ■ (^Ikl) + E m ^' 

J€£„ fc=l 

which completes the proof. □ 

We now turn to the proof of Theorem 16.11 If we write JCj as the inverse Fourier 
transform of the flag multiplier mj of Lemma 16.71 we have shown that K, = JCq + 
YTj=i K>ji and for 1 < j < n, JCj is a flag kernel adapted to a flag which is coarser 
than T . Also since m (£) = J2jee n m o (0^/(0 5 we can write 

Je£„ 

where 

= m (2"- • Ci, • • • , 2"*» ■ ^Ml&l) ■■■Vdtnl), 
and the sum converges in the sense of distributions. The differential inequalities for 
mo imply that each function ty 1 G S(M N ), with Schwartz norms uniformly bounded 
in I. Also since vanishes on the coordinate axes, for 1 < k < n we have 

* J ( Xl ,...,x„) dx k = 0. (6.11) 
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In order to complete the proof of Theorem 16.14 it remains to show that we can 
replace the Schwartz functions ty 1 in equation f)6.10p by functions in C™(M. N ), all 
supported in the unit ball with strong cancellation, and which vanish when |xi| < e. 
This is done in two steps, using Lemmas 16.51 and 16.61 

First, according to Lemma [6 .5[ for each I G S n there are functions ip k)I G Cq°{R n ) 
each supported in the unit ball and having strong cancellation, so that ^> 7 (x) = 
Yl'kLo 2~ h ® ip k,I (2 ~ k ■ x), and for every A > and every positive integer m there is 
an integer p m so that 

||^|| (m) <2- fcA ||^|| [m+Pm] . 

Thus [V> J ]i(x) = ^ =0 2- Q ^ k+i ^~"- Qn( - k+in ^ k ' I (2~^+^ ■xi,...,2-( < » + *) ■ x„), and 
so formally 

oo 

[^ 7 ]i(x) = J2J2 2- Ql ( fc+il )-- Q "( fc+i ")^ fc ' / (2- (il+fc) • Xi, . . . , 2-(** + *> ■ x„) 
ie£ n k=o ies n 

00 



2- Ql ^-'— Qn ^iff k ' I (2- jl ■ xi, . . . , 2~ jn ■ x n ) 

k=0 J&£n 

oo 

--0»(jn) J^^vj ( 2 -ii . Xl , . . . ,2~ j - ■ x n ) 



Jes„ k=o 

oo 



EE*" 



Je£ n k=o 



The estimates we have on the functions show that the series 57J^L V^' 7 = tf 1 

converges in C™(M. N ) to a function in supported in the unit ball which has strong 
cancellation. This formal calculation is easily justified by applying it to finite sums 
of dilates. Thus we have shown JCq = ^TJ/ef with convergence in the sense of 
distributions, where the functions tp 1 G C™(M. N ) all have support in the unit ball, 
have strong cancellation, and are uniformly bounded in each semi-norm 1 1 ■ \\( m ). 

Finally, Lemma [6761 shows that for each I G S n , there exist functions ^ G Cq°{R n ) 
with strong cancellation, each supported in the unit ball, normalized relative to (f 1 , 
and vanishing when |xi| < | so that <£> 7 (xi, x 2 , . . . , x n ) = Yl°j=~oo 2 -J ^ 1 <^ 7 ' 7 (2~- 7 ■ 
xi, x 2 , . . . , x n ). We then have 



[v J ] jr (xi,x 2 , • • . ,x n ) 


= 2^ +h)Ql - i2Q ^-~ inQn ^' I (2^ j+ll) ■ xi, 2" 12 • x 2 , . . . , 2 _< " • x n ), 



J = -CO 

and so since we are summing only over non-positive indices j, we have 


[^ 7 ]/(x) = J2 Yl 2~ u+h)Ql - l2Q2 -- inQ "<p j ' I (2- {]+h) • xi, . . . , 2" 

IeS n j=—oo l££ n 
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Y 2- ilQl - i2Q2 —- inQn ip j ' 1 (2~ h • xi, . . . , T in ■ x n ) 

j=-oo IeS n 



2- < i0i- < a0a-—inQn J ^ (2^i . xi, . . . , ■ X, 



ie£„ j=—oo 





E I E ^ 



The estimates we have on the functions show that the series X^=-oo = 

converges in C™(M. N ) to a function in supported in the unit ball which has strong 
cancellation, and which vanishes when |xi| < |. This at last completes the proof of 
Theorem 16.11 

6.2. Dyadic sums with weak cancellation. 

The main result of this section, Theorem 16.81 is a strengthening of the converse 
to Theorem 16.11 which would assert that sums of dilates of appropriate compactly 
supported bump functions with strong cancellation are flag kernels; we consider 
sums of Schwartz functions instead of compactly supported functions, and more 
critically, we assume only weak cancellation instead of strong cancellation relative to 
the decomposition R N = R ai © ■ • • © R a " . 

Theorem 6.8. For each I G S n , let ip G <S(R ), and suppose 

(a) there are constants Cm > so that ||<^ j ||[at] < Cm for each I G S n and each 
N > 0; 

(b) there is a constant e > so that each ip 1 has weak cancellation with respect to I 
with parameter e relative to the decomposition R N = R 0-1 © • • • © R a ™ . 

Then we have the following conclusions. 

(1) For any finite set F C E n , the function Kp = ^2j e p[<fi I }i G S(R N ) defines a flag 
kernel /Cp for the flag T with bounds which are independent of the set F. 

(2) Let Fi C F 2 C ■ • ■ C F m C ■ • • be any increasing sequence of finite subsets of E N 
with En = (Jm=i F m - Then for any test function if) G <S(R ), 

lim (^> m ,^) = lim V / [y? 7 ]/(x)^(x) dx 



exists and defines a flag kernel K, G <S'(R ) which is independent of the choice 
of the finite subsets. We write this limit as JC = lim 

F/E N i ^ 

Since the proof of this result is somewhat involved, let us indicate the main steps. 
If F C E n is any finite set, let Kp = Xl/eF^]/- We show (in Proposition 16. 9p 
that Kp = Y2ie£ [^Ii satisfies the correct size estimates, and also (in Proposition 
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16. lip that Kp satisfies the cancellation conditions with constants independent of F of 
Definition 12.31 This will establish part ([T]) of Theorem 16.81 To establish the existence 
of the limit in part (j2J), we use the weak cancellation of the functions {f 1 } to show 
that for any test function if), the bracket (JCp,ip) can be rewritten as the integral of 
ip and its derivatives against locally integrable functions. The existence of the limit 
then follows from the Lebesgue dominated convergence theorem. 

We now turn to the proofs. Note that in the next Proposition we impose no 
cancellation conditions on the functions {^p 1 }- 

Proposition 6.9. For each I £ S n let tp 1 £ S(M. N ), and suppose there are constants 
Cm so that for all I £ S n and all M , ||9? 7 ||[m] < Cm. Let F C E n be a finite subset, 
and let Kp(x) = X^/eF^M*) ■ F° r an V a = (^i> • • • 5 ®n) £ X • • • X N a " with 
| or] < M, t/iere a constant Am independent of the finite set F so that 

n 

|fl£/G.(x)| < A M JJ [iV^xO + iV 2 (x 2 ) + ■ ■ ■ + • 

Proof. It follows from Proposition 14.31 that 

|^iT F (x)| = I J22-^= lij{Qj+lScjl) d^[ip T ](2- h • xi, . . . ,2~ l " • x„ 

Since y2 7 £ <S(IR n ), for any M we have the estimate 

N 

|^[/](2- il x 1 , . . . ,2-*"x„)| < C M (1 + £V*jV fc (x fc )) 



-M 

z -i\ k [pc k )) 

k=l 



Thus 



\9*K F (x)\ < Cm 2-^*«* + ™>(l + f^2^iV fe (x fe ))- M 
leF k=\ 
Proposition 16.91 thus follows from estimate (lll.2p in Proposition lll.il □ 

The next result provides estimates that will be used in establishing the cancellation 
conditions (part (jBJ) of Definition I2.3P for finite sums Kp = Xl/eF^ 7 ]/- R- ecau that 
these cancellation conditions involve integrals of the form 



K(pci, . . . , x. n )ip{Ri ■ x mi , . . . ,Rp ■ x m ) dx mi ■ ■ ■ dx r 



where . . . , mp} is a non-empty subset of {1, ... , n}. Let M = {mi, . . . , mp}, and 

let L = {/i, . . . ,l a } = {1, . . .,n}\M. Let N a = a h -\ \-a ia and Np = a mi -\ Va mp 

so that N a + Np = N. For x = (a 

v Q 



x , x ) with 




(x mi ,...,x m/3 ) Gffi^ 
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For I = (zi, . . . , i n ) G £ n , write 

/=(/',!») where U' = ^---^)^ a 

[I = {im-L, • • • > *m a ) G 

If i? = . . . , Rp) is a /3-tuple of positive real numbers, write 

R • x' = • x mi , . . . ,Rp ■ x m;9 ), 

2 7 "i? • x" = (2*™i i?! • x mi , . . . , • x ms ). 

Finally, let P(M) denote the set of all partitions of the set M = {mi, . . . , mp} into 
two disjoint (possibly empty) subsets A and £>. 

Proposition 6.10. With the above notation, let I = (ii, . . . ,i n ) G £ n , and let tp G 
S{R N ). Let e (R N f>) so that if) can be regarded as a function of the variables 
x" = (x mi , . . . , x m „). Let R = (R\, . . . , Rp) be a (3 -tuple of positive real numbers. 
Define $ on M. Na by setting 

$(x') = / [</?]j(x',x") ip(R 1 • x TOl , . . . • x OT ) dx mi • • -dxm . 
Jr n p 

(1) There exists G 5(lR Aro! ), normalized relative to tp and ip (with constants inde- 
pendent of (Ri, . . . , Rp) ) such that $(x') = [0]j/(x'). 

(2) If ip has weak cancellation with respect to I, there are constants C and e indepen- 
dent of R = (Rx, . . . , Rp) so that 6 = J2(a B)eP(M) ®a,b where for each partition 
M = AUB, 

|0Ab( x ')I<C [ 2- e{im -+ 1 - im - ) [ mm{(R s 2 im °) +t ,(R s 2 im °)- e }. 

m r £A m a (LB 

Thus for each partition M = A U B of the set of variables {x mi , . . . ,x m/3 } ; 
|0a,_b( x ')I i s small due to two kinds of gains: there is a gain 2~ e ^ mr+1 ~ tmr ^ for 
every index m r G A, and there is a gain min {(i? r 2 lmr ) +e , (i? r 2* mr )~ <: } for every 
index m r G B. 

Proof. Make the change of variables 

x" = fx x h-s- (2 im i ■ x 2 im P ■ x "1 

Then 
$fx') 



2 - E?=i f ¥ ,(2~ il x 1 , . . . , 2-*» • x n )^(i?i -V i^-^) dx mi ■ ■ • dx, r 

/ [tp] /; (x', x")^(i?i2 im i ■ x mi , . . . , i?^2 lm /3 ■ x m0 ) dx mi ■ ■ • dx m ^ 



FLAG KERNELS ON HOMOGENEOUS GROUPS 36 

where 

6(x')= / ^(x',x")V(i2i2 imi ■x mi ,...,R p 2 im e • x m/3 ) dx mi • ■ ■ dx m ^ . (6.12) 
Now 

(1 + \x'\) M d2,e{x') = [ (1 + |x'|) M d>(x', x") ^(2 r 'R • x") dx", 

and we can estimate the iS(M Wa )-seminorms of in terms of those of tp and the 
supremum of independent of the choice of R. This establishes assertion ([!]). 

To prove the decomposition and additional size estimates for G asserted in part 
(j2J), assume that tp has weak cancellation relative to /. Since the definiton (16.121) of 
G involves integration with respect to the variables {x mi , . . . , x m/3 }, we will only use 
the weak cancellation in these variables. We can use the first of the Remarks 15.61 to 
write tp as a sum of terms of the form 

(n 2 " e( " +i " 43) )(n^))[^] 

where {mi, . . . , m^} = A U B with A H B = and n G B if n G {mi, . . . , m^}, 
a" : S — > {1, . . . , N} so that cr(m^) G J m£ , and each tp~A,B,a normalized relative to tp. 

If we write B = {m^, . . . , m^ s }, it follows that Q(x') is a finite sum of terms of 
the form 



I (A; B- a) = J] 2" e ^+ 1 ^) / d a(mti) ■ ■ ■ d a(mis) [p A]B]a ] 



^{R^ • x mi , . . . , R p 2 tm i> ■ x m/3 ) dx mi • ■ • dx m/r 

We can use integration by parts to move the differentiations d u ^ m ^ y ■ ■ do(m, la ) from 
¥A;B;a to Differentiating the function ■?/'(i? 1 2 im i ■ • • x mi , . . . , Rp2 lm P ■ x m/3 ) with re- 
spect to the variable x a ( mi ) with a(mi k ) G J m< , brings out a factor {R mi 2^ m ^ ^ ^ mi k\ 



and so we have the estimate 



sGA rS-B 



On the other hand, without integrating by parts, since ip has compact support, the 
integral in the variables {x m , r | r G B} is taken over the set where |x mr | < (R r 2 tmr )® mr 
for r £ B, and this set has volume bounded by a constant times Hre,B(-^» - 2* mr ) _ ^ mr . 
It follows that there exists e > so that 

\l( Ju ...,3p;A;a)\<l[2-^^ J] min {(R r 2^)+^ (i^)" 6 }. (6.13) 
This completes the proof. □ 
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We now show that if the functions {tp 1 } have weak cancellation, then the sum 
Kp{x) satisfies the cancellation condition (condition (jEJ)) of Definition 12.31 We 
use the same notation as in Proposition 16.101 Thus L = {Zi, ...,/<*} an d M = 
{mi, ■ ■ ■ jiTip} are complementary subsets of {1, 2, ... , n}, and we set N a = + 
■ ■ ■ + ai a and N/3 = a mi + • • • + a m . If x = (x 1; . . . , x n ) G WL N , we write x = (x', x"), 
with x' = (x h , . . . ,X;J and x" = (x mi , . . . ,x m/ J. If I = (i u ...,i n ) G £ n , write 
V = (i h ,. . .,i ia ) G E a and J" = (i mi , . . .,i mp ) G Ep 

Proposition 6.11. For each I G £ n let if 1 G S{R N ), and suppose there are constants 
Cm so that that for all I G £ n and all M, ||v? 7 ||[m] — Cm- Let F C E n be a finite 
subset, and let Kp{x) = Xl/eF^M*)- Let ijj G (R^J be a bump function in 
the variables x" = (x mi , . . . , x m/3 ). Let R = (Ri, . . . , Rp) be a (3 -tuple of positive real 
numbers, and let 7 = (7^, . . . , 7^) G N a! i © ■ ■ • © N a ' a . There exists a constant C^, 
independent of R so that 



/ ^F(x / ,x // )^(i?-x / ')dx // |<C'^f[[^ 1 ( Xil ) + ...iV^(x i 



Proof. Recall the definition of I' and I" from just before the statement of the Propo- 
sition. Using Proposition 16.101 to write J[</9 / ]t-/(x', x")i/j(R ■ x") dx." = [G / ]t-'(x / ), we 
have 

/ AV(x', x.")iP(R ■ x") dx" = [ [v'h'&i x"W(R ■ x ") 

ieFcE n 

Each Q 1 is normalized relative to <p. We write the sum over I G F as an iterated 
sum as follows. Let 

E 1 = {/' = (i h , . . . ,z,J G Z a I (z 1; . . . ,z n ) G F} , 

and for I' E E±, let 

£ 2 (I') = {/" = (i mi , . . . , i mp )} G 1? I (z'i, . . . i n ) G F C • 
If 7 G F, we write / = (/', I") with /' G £1 and I" G F 2 (J'). Then 

/ K F tf, x")iP(R ■ x") cZx = ^ [ ^ e^'- 7 ")" 
J J'eEi l"eE 2 {l') 

We must show that this sum satisfies the differential inequalities for flags on the 
space IR Qi i © • • • © IR a '" , with constants independent of the finite set F. This will 
follow from Proposition 16.91 provided we can show that for each I' G E±, the sum 
Y^i"eE 2 (i') ®^ ' T ^ converges to a normalized Schwartz function. However, this follows 
from the estimates in part (j2J) of Proposition 16.101 □ 



fx') 



J /' 
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We now turn to the proof of Theorem 16.81 As already indicated on page 1331 just 
before the statement of Proposition 16. 9\ part (1) is an immediate consequence of 
Proposition 16.91 and Proposition I6.11[ so we only need to establish part (2). Let 
ip G iS(lR n ), and for each / G S n let </r G «S(R ) have weak cancellation relative to 
I. According to the second of the two Remarks 15.61 we can write 



and so 



ai 



[/] / = ^(2^^)[^] / + 2-^--)[^]x. 
i=\ 

Then integrating by parts, we have 

/ [/(x)W(x)dx=-£/ 2^[^] / (x)^(x)dx+ / 2-^>[^(x) dx. 
Jr n J^l Jr n ax i Jr n 

Thus if F C E n is a finite subset, and Kf(x) = ^/ g i?[v j/ ]-r( x ), we have 
Kf(x)i()('x) rfx 



«i 



leF 



E / I E 2 * 1 *^*) tr( x ) rfx + / I E 2 " £(i2 " n) ^o]/ ^(x)rfx 



/eF 



where 



V/ i^(x)|^(x)dx+ / ^°(x)^(x)rfx 



/4(x) = E2' ild 1^]/(x), l<<< ai) 

l£F 

E2- efe - n) [^]/. 

/GF 



If a = . . . , «tv) G N^, we have 

<9 a i^(x) = E 2^2-^=1 i ^* + ^9 a [^](2- r • x), 
/gf 

<9 a K°(x) = ^2- £ ( I2 - !l) 2-^i i '^ + ^¥[^](2- ; ■ x). 

/GF 

It follows from Proposition 111.11 in Appendix II that (at least if de < Qi + 
|<9 Q i^(x)| < C7 JVi(xi)* II[Ni(xi) + • • • + iV,(x J )]- (Qj+[Qjl) , and 
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\d a K° F (x)\ < C A^i(xi) € (iV 1 (x 1 ) + iV 2 (x 2 ))- £ n^i( x i) + ■ • • + iV i (x 7 -)]~ Wi+Ia,1) - 

j'=i 

The functions on the right hand side of the last two inequalities are integrable on 
WL N . The proof of (2) then follows from the dominated convergence theorem: 




where 

K e (x)= ^2^[^] / (x), l<t<a lt 
iee n 

^°(x) = ^2-^--)[^],. 

6.3. Rewriting sums of bump functions with weak cancellation. 

It follows from Theorem 16.81 that a sum of dilates of normalized bump functions 
with weak cancellation converges to a flag kernel, and it follows from Theorem 16.11 
that a flag kernel can be written as a sum of dilates of normalized bump functions 
with strong cancellation plus a sum of flag kernels adapted to strictly coarser flags. 
It follows that a sum of dilates of functions with weak cancellation can be rewritten 
as sums of dilates of functions with strong cancellation relative to coarser flags. In 
this section we give a direct proof of this fact. The basic idea is to use telescoping 
series to replace a function with weak cancellation by a sum of functions with strong 
cancellation plus an error term which belongs to a flag which is coarser than the 
original flag. Aside from its intrinsic interest, we shall need this observation in the 
forthcoming paper [NRSWlT] . 

Thus consider the standard flag Ta on 1R of step n associated to the decomposi- 
tion 

(A) R N = R ai ©•••©M a ™. 

Any strictly coarser flag JFq >- J 7 ^ then arises from a decomposition 

(B) R N = R bl © • • ■ © R bm 

where m < n and each M bi = M. ar i © • • • © IR as ? where 1 = r±, n = s m , rj < Sj for 
1 < j < m, and r^+i = sj + 1 for 1 < j < m — 1. As usual we let 8 n denote the set 
of n-tuples of integers / = (z'i, . . . ,i n ) with i\ < ■ • ■ < i n . For any strictly coarser 
flag T& of step m < n as above, we let S& denote the set of m-tuples of integers 
J = (ji, . . . ,j m ) with ji < ■ ■ ■ < j m . Given the argument for part (j5J) of Theorem 
16. 8\ we shall only concern ourselves with finite sums, and thus will not need to worry 
about convergence questions. 
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Proposition 6.12. Let Tj± denote the standard flag associated to the decomposition 
R N = R ai © ■ • ■ © R an . Suppose that 

(x) = J> X ],(x) (6.14) 

ie£„ 

is a finite sum, where each ip 1 is a normalized bump function for the flag T_a, with 
weak cancellation relative to I G S n with parameter e. Then we can write 

*(*) = ££foB]j (6-15) 

B> A Jes B 

where the outer sum is taken over decompositions equal to or coarser than A, and 
each r\ J B is a normalized bump function which has strong cancellation relative to the 
flag J-jg associated to the decompsition B. 

Proof. We argue by induction on the number of steps n in the original flag A. For 
n — 1 there is nothing to prove, because then there is no distinction between weak 
and strong cancellation. 

Thus suppose the proposition has been established for all flags of step less than or 
equal to n — 1, and consider the flag of step n corresponding to the decomposition 
A. The inductive step itself requires an induction. Let the function K be given by 
f)6.14p . Since each ip 1 has weak cancellation with respect to / with parameter e, we 
can write 

<p* = ^2 2~ e ^ B{jl+1 - jl) r] I s 

5c{l,2,...,ri-l} 

where the sum is over all subsets S of {1, . . . , n — 1} and each 77^ is a normalized 
bump function which has integral zero in each multi- variable x r with r ^ S. Thus 
we have 

K=Y^ £ 2-^^1+1-31)^. ( 6 .i 6 ) 

IeSn SC{1,2,...,71-1} 

We will prove by induction on k, for 1 < k < n, that K can be written 

K=J2 £ 2' £ ^Ui+i-n)[ v ' s ] I+ K B , (6.17) 

l££„ SC{l,2,...,n-k} ByA 

where the functions {^5} and {K B } have the following properties. 

(i) Each normalized bump function rjg has integral zero with respect to each vari- 
able x r with r ^ S. 

(ii) For each B >~ A, (i.e. for each decomposition B strictly coarser than A and 
hence whose corresponding flag has step strictly less than n), the function K 
can be written as a finite sum 

Je£{B) 

where each 9g is a normalized bump function with weak cancellation with some 
parameter e' > relative to the flag arising from the decomposition B. (It 
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follows from the induction hypothesis that each such function can be rewritten 
as a sum of dilates of normalized bump functions with strong cancellation.) 
(iii) The bump functions {t] ! s } and the are uniformly normalized relative to 
the normalized bump functions {y? 7 } defined in (16.141) . 

Clearly equation (16. 161) gives the desired conclusion in (16.171) for k — 1. Moreover, 
when k = n the set S must be empty and thus we will have written K as an 
appropriate sum of dilates of normalized bump functions with strong cancellation. 

Thus we turn to the induction step. Suppose that equation (I6.17P holds for a given 
k < n. We must show that (16 . 1 7[) also holds with k replaced by k + 1. We split 
the first sum into two parts depending on whether or not the subset S contains the 
element n — k: 

Je£n le£ n B>A 

SC{l,2,...,n-k} SC{l,2,...,n-(k+l)} 
n-keS 

= K 1 + K 2 + K 3 . (6.18) 

Now K 2 + K 3 are already of the form in (16.171) with k replaced by k + 1 , so we only 
need to deal with K\. 

Thus let n — k e S C {1, 2, . . . , n — k}, and consider the corresponding term rj = 7]$ 
in K\. Let Q n -k denote the homogeneous dimension of the space IR a "- fe . Then the 
function 

(t7s)'(xi, . . . ,x n ) = 2 Q "- fe ?7s(xi, . . . ,2 • x n _ fe , . . .,x n ) - rj J s {^-u ■ ■ -,^n-k, •••, x n) 

has cancellation in the variables x r for all r ^ S' = S \ {n — k}. Note that 5" C 
{1,2, ... ,ii — (k + 1)}. Let J = (ji, . . . , jn). Using a telescoping series we have 

Jn — k-\- 1 Jn—k 

^(x) = Yl 2- iQn - k (r) J s)'{xi, ■ ■ ■ , 2" 4 • x n _ fc , . . . , x re ) 

i=l 

_|_ 2~(-?™- fc + 1— 3n—k)Qn-kyjJ 2 _ ^ n_fe + 1 ~3n—k ) . X X ) 

and hence 

jn-k+1 

lv's}j = \.^s)'\(ji,---jn-k-i,i,3\-k+\,---,^) + ^ ■ (6-19) 

i=Jn-k+ 1 

We regard the last term as associated to the coarser flag of step (n — 1) associated 
to the decomposition 

(B) R ai © IR a "-<=-i © []R a "- fc © K a «-fc+i] © ]R a «-fe+2 © . . . © 
where ]R a ™- fc © ]R a «-fc+i is now considered one factor. If we set 

J{ (jl) • • • i jn— k— 1) jn— k+li • • • ijn)i J {jli ■ ■ ■ > jn— k— lj jn— k+1 1 ■ ■ ■ ijn)i 
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then G S n for j n _ k + 1 < i < j n -k+i and J n ~ k G Moreover, rjg has weak 

cancellation with respect to the flag (£>) with parameter e. Formula (I6.19P then 
becomes 

jn-k+1 

ivth = E tfos) V"" + foff]jK=J • (6-20) 
Applying the identity (16.201) . we obtain 

K 1= J2 E 2- £ ^es(Ji + i-ji)^j 

JeS n SC{l,2,...,n-k} 
n-keS 

= E E a-^t-w^-^i^^jij 

J6£n SC{l,2,...,n-fc-l} 

jn — k~ 1 (n — k) 

= E E 2- £E ^ S U{„- fc} ; + l-i i ) ^ [(^" )'],(»-*) 

Je£n SC{l,2,...,n-fc-l} i=i„_ fc+ i 

+ E E 2-«S« e SU { n- fc ,W« + l-i«)^J !: = S ] j5 - I 

Je£„ SC{l,2,...,n-fc-l} 

= Ei + E 2 . 

In Ei we change the order of summation, grouping together all the terms for which 
Jf 1 ^ is a given J' = . . . ,j' n ) G £ n . Clearly, this condition forces J to differ from 
J 1 only in its component j n -fc, an d we have j' n ^ k _i < j n -k < j' n -k- m or der to 
express the factor 2~ e ^-' !eSu {' i - fe } ( ''' i+1 ~' : '^ in terms of J', we split the summation over 
the subsets S into two parts; the first consists of subsets S not containing n — k — 1 
and the second consists of the subsets which do contain n — k — 1. We then have 

Ei 

= E E ( ' £ 1 a-^-o i^'y] j; ) 

J'S£„ 5C{l,2,...,n-fe-2} i=i^_ fc _i 

+ E E 2-^ !eS (^ 1 -i i ') 2 -%- fc+1 -iU- 1 )( ^ 

J'e£ n SC{l,2,...,n-fc-2} i=j' n _ k _ 1 

— Ei ; i + Ei ;2 . 

Each function appearing in E^i has integral zero in each variable x r with r ^ B C 
{1, . . . , n — A; — 2}, and each function appearing in Ei 2 has integral zero in each 
variable x r with r ^ B U {n - k — 1} C {1, . . . , n — k — 1}. All the rf in the above 
formula are normalized relative to the initial data. Hence, the term in E^i indexed by 
(J 7 , S) contains a function fj s normalized relative to the initial data, and multiplied 
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by a factor 

i' t,-i 

J n — k 

2- £ E ieS (i i , + i-i ; ') 2" e(j "-fc+i^ ) < 2~ e ^'6s0'i'+i-ii') ; 

in—k— 1 

whereas the corresponding term in Ei )2 is a bump function Vsu{n-k-i} norm alized 
relative to the initial data and multiplied by 

2- £ E i6S 0' i ' + l-J;')2- £ 0n-fc-in-fc-l)f 7 ' / ^ < 2" £ '^i6SU{n-fe-l}(i;'+l-ii') 

\Jn~k Jn—k—U ~ > 

with e' < e. Hence, 

^=e e 2-^^-^^'],,, 

J'G£„ SC{l,2,...,n-fc-l} 

and Ex together with gives the first sum in (16.171) with k replaced by k + 1. 

It remains to prove that £ 2 can be absorbed in the remainder term (second sum) 
of (I6.17p . We group together the terms at the same scale and separating the sets 
S containing n — k — 1 form the others. Indexing the elements J' of as J' = 
■ ■ ■ ,? n -k-i7 fn-ki ■ ■ -Jn-i) e £ n-i, we have 

E E 2- £ ^- {n - fc} (i i+ i^) [77 ^ ]j _ 

Je£ n SC{l,2,...,n-k-l} 

= E E ( E 2-^^^ fc >^^)hr fc ] Jn ^) 

J'e£„-l SC{l,2,...,n-k-2} JeSn'.J^^J' 

+ E E ( E 2- £ ^^u { „-,- 1 ,„-, } (im-,-0^] j _^ 

J'G£ n _i SC{l,2,...,n-fc-2} j eSn :J^ =J/ 
= ^2,1 + S 2j 2 • 

As in the previous discussion, each term in parentheses is a function normalized 
relative to the initial data, multiplied by a factor controlled by 

2- £ E ieS 0'i+i-iO 2~ e(j "-fc~^ < 2~ e ^ i GsO'i'+i-Ji) 

J n— fc — 1 

for the terms in S 2 .i and by 

2- £ ^ies(ii+i-h)o~ e (j' n -k-i' n -k-i) ( j' — j> ) < 9~ e '^iesu{n-fc-i}0'i'+i-ii') 

\Jn—k Jn—k—U ~ ' 

with 5' < e, for the terms in S 2 2 . It follows that 

s 3 = E E 2-^ 16S a; +1 -« K '] jM 

J'G£ n -i SC{l,2,...,n-fc-l} 

and this concludes the proof. □ 
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6.4. Restricted cancellation conditions. 

Our next result shows that the cancellation conditions in the definition of a flag 
kernel can be relaxed. As usual, let J 7 denote the standard flag 

(0) c R an c IT"- 1 © R a ' 1 c • ■ • c R a ' 2 © • ■ ■ © R an c R N . 

Theorem 6.13. Let JC be a distribution in «S(R ) which satisfies the conditions of 
Definition \2. Si for the flag J- ' , except that in condition (jbj). the values of the parameters 
Ri, . . . ,R S are restricted to R± > R2 > • • • > R s - Then K. is a flag kernel. 

Before giving the formal proof, let us explain what needs to be done. We are 
asserting that if the cancellation conditions in part (jb)) of Definition 12.31 are satisfied 
when Ri > R2 > • • • > R s , then they are satisfied for all values of the scaling 
parameters. To do this we fix a non-empty subset / C {1, . . . , n}, and a constant 
Pi > for every i G I. For each % G / we choose ipi G C^°(M ai ) equal to 1 on the 
iVj-ball of radius 1 and supported on the iV^-ball of radius 2, and with bounds on the 
norms {||^||( m )}- We set xj = (xj) i€ j and x^ = (xj)^/, and put 

Let /C* = /C* (xj) = (JC, ty p ) denote the distribution in the variables x^ such that 

(K.y , if) = (/C, © (f) for every test function if in the variables Xj. We must then 
prove the following: 

Suppose that JC satisfies the hypotheses of Theorem \6.13[ Then 

(a) If I = {l,...,n}, then \(IC, < C where C is independent of the choice of 
{pi} and {ipi}. 

(b) If I is a proper, non-empty subset of {1, . . . , n}, let io = min{j \j ^ J}. Then 
ICy p coincides with a smooth function for x io 7^ 0, and for every multi-index a, 



dZ>Ki,M'l)\ < C*l[ 




(6.21) 



10 10 
l<i 

where {C a } are independent of the choice of {p{\ and {ipi}. 

In proving (jaj) or (jbj), we may assume that K has compact support, and look for 
non-restricted cancellation estimates that only depend on the constants in Definition 
4.1, and not on the size of the support. For general /C, the conclusion will then 
follow by a limiting argument, based on the following construction. We fix a C^°- 
function ip on the real line, equal to 1 on a neighborhood of the origin, and set 
$ = f © ■ ■ • © f G C™(R N ), $ r = $ o 5 r -i. Then fC r = $ r /C has compact support, 
satisfies the hypotheses of Theorem 16. 131 uniformly in r, and lim^oo fC r = /C in the 
sense of distributions. 
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Proof. For % G J, define Ri as 



i?j = max pi . (6.22) 



Then R\ > R 2 > • • • , and so by hypothesis /C* R satisfies the estimates in (I6.2ip . 
Thus it suffices to prove that the difference /C* — /C* R does as well. 

Denote by 1°, (respectively I + ), the set of % G / such that Ri = p iy (respectively 
Ri> Pi). Setting 77, (x*) = -0i(Pi ■ x*) - -0i(-Ri ■ Xj), we have 



iei° 

Y\ 4>i(Ri ■ Xj)J f (^(-R; • Xj) + 77i(Xi)) - • Xj 

n^(^-xi)) ^ (n^(^))( n 



iei° 9^jci+ ieJ iei+\J 

0^JC/+ i£J ieI\J 

Fix JC J+ J ^ 0. By definition of 7° and J + , for each i £ J, Ri = pi for some 
/ G 1°, / > «. Set i = min{Z <E 1° : I > i and i?j = p;} and J = {i : i G J}. By the 
cancellation of JC in the variables Xj for z G I \ (J U J), 



/cf, p ( X ;) - /cf^) = ^ (x:f, (n^)(n^°^)) 



JUJ 



where each /C* is a distribution in the variables Xj with i G C I U J U J, satisfying 
condition (jaj) of Definition 12.31 

Notice that this pairing can be expressed as an integral because the right-hand 
side / in the pairing above is supported where /CJ is smooth. To see this, denote by 
io the smallest element of C I U J U J. Then K.f is smooth for x io 7^ 0. On the other 
hand, if x = (xj)j gc/uju j G supp/, all coordinates Xj are bounded away from zero 
except for those in J. But every element of J is strictly larger than some element of 
J, therefore io G" J. 

We estimate the a-derivative of each JCf by 

|9°;/C*(X'„ XJ ,X,)| 

<c (n^(x i r*)(n(E jv ^))" Q ')n(E jv 'W)^ w . (6 - 23) 

Ki 
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where Ji = {i G J : i = 1} = {i G J : Ri = Ri}. Since the Ji form a partition of J, 
we have 

ic*>' f) - k* r (x',)| < c(n(E iv '( x '))" <3, " M ) e • 



i0/ 20/ 0^JC7+ ZgJ 

Ki 



With denoting the cardinality of J/ 



/ / {\[Ni{xi)- Qi ){Y< N ^) 

1 , , i i&Ji i£Ji i&Ji 

{A r i(x i )>/J ; _ ,VieJi} Ni(xi)<Rf 

=R7 Qi J (u N ^y Qi )(J2 N ^y Ql 



{Ar i (x,)>/?r 1 ,ViGJi} 



i&Jl i&Ji i&Ji 



< CR~ Ql ] ! Ni{* t y Q ^ Qllmi dx, < c 

Ar- 7. J 



This concludes the proof. □ 

6.5. Invar iance of flag kernels under changes of variables. 

We study the effect of a change of variables on the class of flag distributions. 
If K G S'(R N ), then formally ()C,if)) = f RN AT(x)^(x) dx, where AT is the 'kernel' 
associated to /C. Let F : R — >■ R be a diffeomorphism with inverse G. We want 
to define a new distribution /C* which is the composition of /C with the change of 
variables F. Now formally 

</CoF,^>=/ tf(F(x)Mx)dx= / K(y)^(G(y))det(JG)(y)^ 

where JG is the Jacobian matrix of G. Thus if ip*(y) = tf>(G(y)) det(JG)(y), and 
if the change of variables has the property that if) G <S(R ) implies G <S(R ), we 
can define K* = K o F by setting </C # , ^> = (£, for all if) G «S(R JV ). 
We are primarily interested in changes of variables of the form 

F(xi, x N ) = (xi + Pi(x), . . . , x n + Pjv(x)) 



„ a fc-i 



where P 1 ,...,P N are polynomials of the form P fc (x 1) . . . , xjv) = Yl c k 
with coefficients c£ G R. Thus P& depends only on the variables {x±, . . . ,Xk-i}, and 
this guarantees that F is a diffeomorphism with inverse G of the same form. In 
particular, det(JG)(x) is a polynomial. Thus if if> G S(M. N ), then det(JG) if) o G G 
«S(R ), so /C* is well-defined. We want to show that if JC is a flag distribution 
adapted to the decomposition M. N = M. ai © • • • ©M a ", then /C* has the same property. 
In order to show this, we need to make additional assumptions on the coefficients 
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Definition 6.14. A change of variables y = P(x) of the form y k = x k + Pk{x-) where 
p fc( x ) = Eaee fc c t x T ■ ■■Xk-i 18 allowable if 

fe-i 

B k = |(ai,...,a fe _i) gN^ 1 ) ^a 3 -d 3 - = d fc }. 

3=1 

Theorem 6.15. Let K, G (S^R^) &e a /Za<7 kernel adapted to the standard flag T 
coming from the decomposition M. N = M ai © • ■ ■ © lR a ™ . If y = P(x) is an allowable 
change of variables, thenlC^ = K,oF is a flag distribution for the same decomposition. 

Proof. We can assume that /C = ^2 Ie£ l^ 7 ]/ where each (p 1 is a normalized bump 
function having strong cancellation. Let / G E^. We consider the dilate [</2 7 ]/ 
composed with an allowable change of variables. We have 

[<A(F(x)) = [</],(. . . , x t + P t { Xl , xl,), ...) 

= 2-Sf=i4^/( . . . , 2-^' [z, + P,( Xlj . . . , . . . ) 

Put 

5 J (x) =/(..., x, + 2-* <I P,(2 <,1<1 a; lj . . . , 2 di - lii - 1 x z _ 1 ), . . . ) 
so that [# J ]/(x) = [</]j(F(x)). Put 

P/(x) = 2~ dlil P l {2 1 • x) = ^ 2- dii!+ ^=i^^^ c^ 1 • • -Xfc*! 1 . 

Then I (x) = tp 1 (x\ + P/(x), . . .,xn + Pjv( x ))- Since the change of variables is 
allowable and I G £„, we have 

z-i z-i 



fc=i fc=i 



It follows that each P/ is normalized relative to p, and this shows that [if I ]j(F(x)) = 
[^ 7 ]/(x), where 9 1 G C^°(lR Ar ) is normalized relative to (p 1 . 

Next we study the cancellation properties of 9 1 . If we can show that each 9 1 has 
weak cancellation relative to the multi-index I G S n , it follows from Theorem 16.81 
that 

£of=5>vf=5> j ] / 

is a flag kernel, which is what we want to show. 

To do this we use Proposition E73 Let {1, . . . , n} = A U B with A n P = 0. We 
study 



y J (x A ,x B ) dx B 



J fcefl K° fc 
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= j </(.-., xi + 2- d ^'P l (2 d ^x 1 , . . . , 2 d '-^x l . 1 ), ...) d* B . 

Since (p 1 has strong cancellation, we can write it as a sum of terms of the form 
dj 1 ■ ■ ■ djjp 1 , where each index ji G J\. It suffices to consider the integrals 

J d h ■ ■ ■ 9 jn ^(- ■ ■ , x l + 2- d ^P l (2 d ^x u . . . , 2 d * i *x jr , . . . , 2 dl ~^x l . 1 ), ...) dx B . 

Let r G B. In this last integral, replace the term 

2~ dm P l (2 dlil x 1 , 2 d ^x jr , 2 dl - li '- 1 x^ 1 ) 

for I > j r by the term 2~ dlk Pi{2 dlh x u . . . , 0, . . . , 2 d '- li '- 1 xi^ 1 )] i.e. we set x jr = 
everywhere in the integrand except where it appears by itself in the j l r h entry of 

d h ■■■fyn'P 1 - Now 



d n ■ ■ ■ d jn <?(. . . , x x + 2- d ^P l (2 d ^x u . . . , 0, . . . , 2*-^'-^ I _ 1 ), ...)dx B = 



since one of the variables we integrate is Xj r , and we are integrating the derivative 
of a Schwartz function. Thus it suffices to estimate the integral of the difference: 

d h ■ ■ ■ d jn tf(. .., Xl + 2- d ^P l (2 d ^x 1 , . . . , 2 d * i *x jr , . . . , 2 dl -^x l . 1 ), ...) 



8 n ■ ■ ■ d Jn ft(. .,!, + 2^ d ' l 'P l (2 d ^x 1 , . . . , 0, . . . , 2 d '-^x l . 1 ), ...) 



dx.B- 



A typical term in the polynomial P; has the form Cqi" 1 • • -xf^ 1 where ot\di + • • • + 
= d\. Thus we get a 'gain' whose size can be estimated by a sum of terms 
of the form 

I |9~di«i+ai(2i«iH hcti_i<ij_iii_i I lai |„ loj-i 

\^a\^ l-W— 1| 

where Oj r > 0. However, 

-c^z + ^2 a td t k = [-di + y~] a t dt]ii + ^ a t d t (i t - z'z) 
t=i t=i t=i 

i=l 

< —a r d r (ii — i r ) < — e(z r+ i — i r ). 

< 2~ <E (*'-+ i ~ iT ') 



Thus for every r G B we have shown that | J ffi Rafe 6 |/ (x j4 ,x B ) o?x B 
Thus with a smaller e we have 



/ J ( Xil ,x B )dxJ < TT2 



e(i r +i— i r ) 
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and it follows from Proposition 15.71 that 9 1 has weak cancellation. This completes 
the proof. □ 

7. Convolutions on nilpotent Lie groups 

7.1. Homogeneous nilpotent Lie groups. 

We now begin the study of operators /—>•/* K, where /C is a flag kernel, and 
the convolution is on a homogeneous nilpotent Lie groug G with Lie algebra q. 
To say that a Lie group G is homogeneous means that there is a one-parameter 
group of automorphisms 5 r : G — > G for r > 0, with 5i = Id. As a manifold, 
G is an iV-dimension real vector space, and we assume that with an appropriate 
choice of coordinates, G = M. N and the automorphisms are given by 8 r [x] = r • x = 
(r dl xi, . . . ,r dN xn) with 1 < d\ < di < • • • < 4v- We begin by summarizing the 
facts about group multiplication, invariant vector fields, and group convolution that 
we need in this context. Additional background information, and in particular the 
proofs of formulas (17.11) and (17. 2p below, can be found in the first chapter of (FS82J. 

The product on G = R is given by a polynomial mapping; if x = (x l7 . . . ,xn) 
and y = (yi, . . . , y^), the k th component of the product xy is given by 



(xy) fc = x k + y k + M fc (x, y) = x k + y k + ^ c^if • • • x^yf 1 ■ ■ ■ y^ 1 ( 7 ^ 
where {c^} are real constants, and 

fc-i 

M k = {(a;/3) = (a u . . . ,a k -i]Pi, • ■ -,Pk-i) E d *( a * + A) = 4}- 



Note that M k (r ■ x, r • y) = r dfc M fc (x, y). 

Next, let {Xi, . . . , X^} and {Yi, . . . , Yjy} be the left- and right-invariant vector 
fields on G such that at the origin, X k = Y k = d Xk . Then 



di>d k di>d k 

* = £ + E £ = £- t +t E ~<*?- ■ *S' l t 

l=k+l K l=k+laeSj d d 

di>d k d t >d k 



(7.2) 



where {a kl } and {a k } are real constants, and the index set S) d is defined in Propo- 
sition [2J3 It follows that P kn P kl G 'Kd l ~d k - 

The bi-invariant Haar measure on G is Lebesgue measure dy = dy\ ■ ■ -dy^. The 
convolution of functions f,g G L 1 (G) is given by 

f*g(x)= [ f(*y~ 1 )g(y)dy= [ /(y^y^x) dy, 

JG JG 
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and the integral converges absolutely for almost all x £ G. The following result can 
be found on page 22 of [F552] . 

Proposition 7.1. Let f,g e C 1 (G)f]L 1 (G). 

(1) If X is a left-invariant vector field and Y is a right invariant vector field, then 
X[f*g] = f* X[g] and Y[f * g] = Y[f] * g. 

(2) If X is a left-invariant vector field and Y = X is the unique right-invariant 
vector field agreeing with X at the origin, then X[f] * g = f * Y[g}. 

(3) If 5 = Sq denotes the Dirac delta-function at the origin, then (p(x) = ip * S(x) = 
S * ip(x) for ip G Cq°(G). In particular, if X is a left-invariant vector field and Y 
is a right invariant vector field, X[ip] = ip * X[S] and Y[tp] = Y[S] * tp. 

Using the formulas in (17. ip . we can write the convolution of integrable functions 
/ and g as 

f*9{x)= f(---,x m -y m -P m (x,y),...)g(... 1 y m ,...)dyi---dy N (7.3) 
Jr n 

where each P m is a polynomial in the 2m — 2 variables {x\, . . . , x m _i, yi, . . . , y m -i} 
satisfying P m (2 K ■ x, 2 K ■ y) = 2 femdm _P m (x, y). In the formula ( 17.30 . the variables in 
x appear in the argument of /. However by a change of variables we can move some 
or all of them to the argument of g. Thus if S is any subset of {1, . . . , iV}, we can 
write 

Mjv(x,y)) 0(vi(x,y), • • .,v N (x,y)) dy x ■ --dy N (7.4) 

x m - Vm - <5m(x, y) iimeS, 
y m if m (£ S, 

: (7.5) 

Dm lime S, 

x m - y m - <5m(x, y) if m (£ S. 

Here each Q m = Q m is a polynomial in the variables {x±, . . . , x m _i, yi, . . . , y m -i\ 
with the same homogeneity as P m ; that is Q m (2 K ■ x, 2 K ■ y) = 2 femdm Q m (x, y). 

7.2. Support properties of convolutions [ip]j * 

In this section we study the support properties of the convolution of dilates of 
normalized bump functions with compact support. Given integers i,j G Z, we set 
i V j = max{z, j}. Given iV-tuples I = (ii, . . . , ijy), J = (ji, . . . , Jn) £ % N , w e set 

lVJ = (hVji,...,i N Vj N ). (7.6) 

We want to show that the convolution [tp]j * is the I V J-dilate of a normalized 
function. 



f*9(x) = / /(«i(x,y), 
Jrn 

where 

« m (x,y) 
«m(x,y) 
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Lemma 7.2. Let tp,ip £ C™(M. N ) have support in the ball B(p). Then for any I, J £ 
E N there exists 6 £ C™(WL N ) supported in the ballB{Cp) such that = [@]ivj, 

and ||0||( m ) < Cm.|| < / ? ll(m)||^'||(m)- The constants C and {Cm} can depend on the 
radius p, but are independent of the functions <p and ip. 

Proof. Let K = I V J and put 

e=[[tp]i*[ii>]j]_ K . 

It suffices to show that 9 is supported in the set {x £ M. N \ \xk\ < Cp, 1 < k < N} and 
that ||^||(m) < C m ||<p||(m)||^ll(m) for some absolute constants C and {C m }. Making 
the change of variables y m — > 2 kmdm y m and using the homogeneity of the functions 
{u m } and {v m }, we have 

[[<p]t*M]-kW 

= 2 E ™ dmfcm ([<p] / * [^] J )(2 dlkl x 1 ,...,2 dNkN x N ) 

= 2 J2 m 2d m k, n f [ V9 ] / (... ;Um ( 2 ^^a; 1 ,...,2 d ^x 7 v,y),...) 
Jrn 

[xjj] j( . . . , v m (2 d ^x u 2 d » k »x N , y),...)d yi ---dy N 

= 2T, m d m ( 2 k™-im~jm) / (f(..., 2*™ ^ U m (x, y) , . . . ) 

Jr n 

2 dm{km - jm) v m (x, y), . . . )d yi ■ ■ ■ dy N . 

Note that 2^™"^ > 1 and 2 d ^ km ~ j ^ > 1. It follows that if [<p r * ^j]-a'(x) ^ 0, 
there exists y = (yi, . . . , yjy) £ M. N so that for 1 < m < N, 

| Mm (x,y)| < 2 d ^ k --^\u m (^y)\ < P, 
|u m (x,y)| < 2 d ^ km -^\v m (^y)\ < p. 

We show by induction on m that these inequalities imply that \x m \ + \y m \ < A m p 
for an appropriate choice of constants A\ < A2 < . . . < An- When m — 1, we have 
\xi — Ui\ < P and < p, so |a;i| + 1 2/1 1 < 3p, and we can take Ay = 2. Next, 
assume by induction that |a; s | + \y s \ < A s p for 1 < s < m. Since Q m (x.,y) depends 
only on the variables {x\, . . . , x m -i, y±, . . . , y m -i}, it follows that |Q m (x, y)| < B m p 
where B m is a constant that depends on the coefficients of the polynomial Q m , on the 
constants {A s } for s < m, and on p. We have \y m \ < p and \x m — y m — Q m ( x , y)| < P, 
so \x m \ + \y m \ < (5 m + 2)p. This completes the proof of the statement about the 
support of 9. 

To establish the estimate ||0||( m ) < C m \\<p\\(m)\\'4 , \\(m), we again use formula (j7.4p . 
but this time with the set S = {m £ {1, . . . , N} | j m < i m = k m }. Of the two factors 
|2 c M fc >»-«™) 5 2 dm( - km ~ jm ^ : the one which equals 1 multiplies the expression x m —y m — 
Q m (x, y), while the term y m is multiplied by the larger factor 2 dm( - km ~ ( - lmAjm ^ . Thus 
with this representation of the convolution <fii the integration takes place over 
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the set E = {y G R N | \y m \ < 2 dm( - km (»»» A Jm)) } Thus we can estimate the size of 
H'* by 

|[^*^U(x)| <2^^( 2fc -" i --^||^||o||^||o / dy <C |M|o|H|o 

since 2k m —i m —j m — k m +{i m /\j m ) = 0. When we take derivatives of [[y]/*^] j] _ x ( x )> 
the terms involving the variables x are multiplied by the factor 1, and so we obtain 
in the same way the estimate 

\\[<Pi* fa] 

This completes the proof. □ 

7.3. Decay and cancellation properties of convolutions [tp]j * 

We want to study the decay and cancellation properties of the convolution [tp]j * 
[ip]j under the assumption that tp has cancellation in the variables {xi ± , . . . ,xi a } 
and ip has cancellation in the variables {x mi , . . . ,x mb }. Here decay means that the 
size of [ip] i * [ifj\j is small due to the difference between the iV-tuples I and J; 
cancellation means that the integral of [tp]j * with respect to some variables 
is zero. (See Section 15.21 and Definition 15.31 for the precise definition of strong and 
weak cancellation). Before stating our results, let us see what we should expect by 
considering the much simpler case in which the convolution [</?]/ * e is taken with 
respect to the Abelian (Euclidean) vector space structure of R rather than the 
general homogeneous nilpotent Lie group structure G. 



Let I = (ii, . . . . 


In) and J = 


(ji, ■ 


. .,j N ), and put 




A 


= {se{l,.. 


. ,a} 






A 1 


= {se{l,.. 


. ,a} 


\k > Jis] = {!,•.■ 


,a} \ A , 


Bo 


= {te{l,.. 


■M\ 


jm t — i"mt } ) 




Bx 


= {te{l,.. 


■,b}\ 


jm t > i mt } = i m l. 


. . . , m b } \ B 



Because of the hypothesis on cancellation, we can write 

f = d h - --di a (p, 
tf) = d mi --- dm^. 



For each s G Aq we can integrate by parts in the variable xi s in the integral [v 7 ] J r*e [^] j, 
moving the derivative di s from (p to ip. Since the width of the dilate [<p\i is narrower in 
this variable than the dilate [ip] j, this integration by parts gives a gain of 2~ e ^ l «~ H «\ 
and we get such a gain for each s G A . A similar argument shows that we get a 
gain of 2~ e ( lOT * -J ' m *) for each t G -Bo- Thus the total gain from integration by parts 
is IIseAo 2~ e ^~ H ^ YlteBo 2~ e ^ lm *~^ m *\ In addition to this gain, we observe that in 
the convolution [ip]j * e [ip]j, the derivatives di g for s £ A± and d mt for t G B-y can be 
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pulled outside the integral. The final result is that there is a compactly supported 
function 9, normalized relative to <p and ip, so that 

\<p\i*mj= n 2 ~ £( ^ j n 2 " £(imf ~ jmt) n n ^^w. 

seA teBo s'eAi t'eB 1 

In other words, we get exponential gains from variables where there is cancellation 
for the function with 'narrower' dilation, and the resulting convolution still has can- 
cellation in the remaining variables. 

When dealing with convolution on a homogeneous nilpotent Lie group, we cannot 
move Euclidean derivatives from one factor to the other. However, we can write 
Euclidean derivatives in terms of left- or right-invariant vector fields which can be 
moved across the convolution. But this process introduces error terms involving 
derivatives with respect to 'higher' variables, and these come with a gain involving 
the differences between entries of I or J. Thus in the case of nilpotent Lie groups, 
we might hope that convolution results in three kinds of terms: gains of the form 
2~ e \ l e-je\ coming from integration by parts in narrow variables with cancellation, 
residual cancellation of the convolution in some variables which are not used in the 
integration by parts, and finally gains of the type 2~ e<y%l+1 ~ % ^ and 2~ e ^ e+1 ~^ e \ This is 
in fact the case, and is made precise in the next Lemma. 

Suppose we are given two decompositions 

(A) : R N = M. ai ©•••©R a ", 

(B) : R N = R bl ®---®R bm . 

Let { Jj 4 , . . . , j£} be the indices corresponding to (^4) and let { Jf , . . . , J^} be the 
indices corresponding to (I). Define 

cr : {1, . . . ,N} ->• {1, . . . , n} such that I £ jf^, 

r : {1,...,N} ->■ {1, ...,m} such that / £ Jf (z) . 

In what follows, ir^ and 7Tb denote mappings from the set {1, . . . , N} to itself with 
the property that n^(£) £ J^m, and 7Tg(7) £ J^L. Also recall from equation (I4.5[) in 
Section 14.11 that we can introduce mappings pj± : £ n -» E N and : £ m — > E N so 
that 

PA(h,---,in) = (h, ■ ■ ■ ,h , *2, • • • M , ■ ■ ■ , i n , ■ ■ ■ ,in), 

bi b m 
PB(jl, ■ ■ ■ ,jm) = (jl, ■ ■ -,jl , J2, ■ • • , h > • • • > 3m, ■ • ■ ,3m)- 

Lemma 7.3. Suppose that tp £ C^°(lR Ar ) has cancellation in the variables for 
£ £ A C {1, . . . ,N}, and that ip £ C^°(lR Ar ) has cancellation in the variables xi for 
I £ B C {1, .. -,N}. Let I = (ii, ...,i n )eS n and J = . . . ,j m ) £ S m . Set 

A = {£ £ A | i a{e) < j rW }, 
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B = {I E B\j T Q) < V ( z)}. 

Then [tp]i * can be written as a sum of terms of the form 

£ 2~ e (M £ )+i~M<?)) J J 2 -e ( iT M+ 1- M ! )) J £ 2 _e ^ T W _i<r ^^ J J 2~ e ^ CT ( ! )~ : ' T ( ! '- ) 
«eAi ieBi teA 2 zeB 2 

[ n d **w n ^8(o e ] P ^(/)v P8 (j) 

l£A 3 1<=B 3 

where 9 is normalized relative to tp and ib, A = A\ U A2 U A3 and B = B\ U B2 U B3 
are disjoint unions, and we have A2 C Aq C Ax U and B2 C I?o C -Bi U B2 . (We 
will have a(£) 7^ n and r{l) 7^ n). 

Proof. Using the cancellation hypotheses, it follows from Lemma 15.11 that we can 
write 

v = {U d *)M ^ = (n (2*^^))^/, 

i&A leA 

* = (n^)t^]' W j = (n i^y)^, 

where if a is normalized relative to tp, and is normalized relative to ib. We can 
use Corollary 14. 81 to write [cp]j as finite sums of terms of the form 

( n 2 " £(M " +i_iiw) ) n^^z^Mi 

where Ai C A is a possibly empty subset, = A\Ax, each Z WA ^ is either the 
corresponding left- or right-invariant vector field, and ip^ is normalized relative to 
ip. Moreover, according to the Remarks 14.91 following Corollary 14.81 the operators 
{Z nA ^} can be put in any desired order. Similarly [tp]j is a finite sum of terms of 
the form 

' Y[ 2- 6 OV(o + i-mo)) Y[(2^smMi)Z 7VBm )[ i p Bl } J . 

It follows that [</?]/ * is a finite sum of terms of the form 

2~ e(iCT («)+ 1 ~ i<T W ) 2™ e( - ?T ( ! )+ 1 "- ?T ( i ' ) 

II (a^^^w)!^]/ * II (a^^w^Ki))^] 

Now let 

A 2 = e Ai I i CT( £) < = A x n A , 1 2 = e Ai I > j rW } = Ai \ A 2 
5 2 = {£ e Bx I j T (o < i a {i)} = BxnB , B' 2 = {£ e Bx I j T (o > = B X \B 2 
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We choose Z^ and Z\ as follows: 

'Li H£eA 2 Z ={ Ll if/G ~ 2 

R e if £ e A 2 ' 1 \Ri ifleB 2 ' 



Then since left-invariant vector fields commute with right invariant vector fields, we 
can use Proposition I7.1[ part flTJ and then Proposition 17.11 part (j2J) to write 

= J](2^>M^ W ) J](2^ 8 (0^(0L WB(/) ) 

£&A 2 l£B 2 
£&A 2 leB 2 

Mi* H^Ai^L^) H(2 d ^Mo RnB{l) )^ Bi 



£€A 2 leB 2 

where (fA 1 and , ipB 1 are normalize^ relative to ip and ip. (Here rifeA; ^ s actually the 
product of the operators in the reverse order). Now we want to commute the op- 
erators H^a^aW^w Li) and UieBo( 2 ^ BwjTW R i) before applying them to [<p Bl ]j- 
According to the third of the Remarks 14. 9[ the result is a sum of terms of the form 

£<=A 3 lGB 3 leB S l ^ A 3 

where 

A 3 c A' 2 , A 3 = A' 2 \ A 3 , 

B 3 C B' 2 , B s = B' 2 \ £? 3 . 

Thus [p Al ]i * UteA^^^Lt) UieB^^^RiMB,}.; is a sum of terms of the 
form 

£&A 3 l€B 3 

ieB 3 £&A 3 



The function tpA-L also depends A and also depends on B. With minimal risk of confusion, 
we shall omit such notation. 
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J(2'W-Wi?,)[ ¥ > i4l ] J * ] J(2^wM0i^)[^ Bs ] 

£eA s ieB 3 'eB 3 te^3 

B 3 ^3 

J 2 _e (Mf)+i-M<o) J J 2 _e ^ T ( ! '+ 1 ~'' T ( i )^ J J 2 _d ' r B( I )^ r ( i )~'' T ( i )) J J 2~ <i ' r .4w(- ?CT (' ? >~ iCT W) 

tel 3 igB 3 fe^s 



ies 3 teA 3 



Now according to Lemma \7.2\ we can write 

[JI [ JJ R^b 3 ] = [Oa 1 ,b 3 }k 
b 3 teA'{ 

where Oa 1 ,b 3 is normalized relative to tp and ip and K = I V J. Thus it follows that 
[(f\ 1 * is a finite sum of terms of the form 

2 _e (v(f)+i- i CT (<?)) J 2~ e 0V(o+i~ i-r(o) J J 2 _a! ' r ^^'^ CT (' ? ' _i<T ( £ '- > J J 2~ d ' r s( i )(^(o _ -'V(i)) 

II (a^^^w) I] ^-^^(oM^aW. 

e&A 2 i&b 2 
However, it follows from part ([I]) of Corollary 14.81 that we can write the product of 



vector fields Uiel {^aW^wR^ f] ~ {2 d ^mMi)L l )[B Al> B 3 \iyj as a sum of terms of 



the form 

n 2- e ^M+ i -M^ n 2- e ^(o+i-jvw) jj(2^w^w^(£)) n( 2 ^ 8( ° jT(o,9 -8(o)[^4,B4]/vj. 

«eAt zeB 4 ^ 6A 4 (eB4 

Thus we have shown that [tp] j * is a sum of terms of the form 

2~ e (v(<')+i-V(£)) 2 -e ^ r ( ! )+ 1- -M ! )) J J 2~ d ' r ^( f '^ tT (' ! '~ i<T ( f ^ J J 2~ d7r e(o(*T(o~iT(i)) 

n (2^ w *- w ^w) n (2^ (i) ^ (i) ^ B( o)[^ 4 ,i ?4 ]/vj. 

teA 4 2eB 4 
This has the form asserted by the Lemma, and so completes the proof. □ 



Corollary 7.4. Let tp, ip e C^°(lR Ar ) /iai>e strong cancellation relative to the same 
decomposition R N = R ai © • ■ ■ © M a ". There exists e > so that if I, J £ £ n , it 
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follows that [tp]j * is a finite sum of terms of the form 
where: 

(a) the set {1, . . . ,n} is the disjoint union of the sets A and A, and of the sets B 
and B; 

(b) each function 9a,b is normalized relative to tp and ip. 

Proof. Let J? denote the set of subscripts £ such that x £ G WL ae , and let a : {1, ... , iV} — > 
{1, . . . ,n} be the mapping such that a(£) G Ji for all £. Since cp and ip both have 
strong cancellation relative to the decomposition M. N = M. ai © • • • ©IR an , if A and B 
are the sets defined in Lemma [7.31 it follows that {o~(£) | £ G A U B } = {1, . . . , n}. 
This means that the sets A3 and B% of that Lemma must be empty, and the result 
follows. □ 

7.4. Truncated flag kernels. 

Definition 7.5. A flag distribution K, adapted to the decomposition M, N = IR ai © 
• • ■ © M. an is a truncated kernel of width a > if the differential inequalities given in 
part figjj) of Definition \2.$ are replaced by 



\%K(x)\ <C,Hla + TV^xO + • • .7V fc (x fc )]- 



-|L<*fcJI 
fc=l 

/C z's an improved truncated kernel is a truncated kernel, and in addition satisfies 



\%K(x)\ < C Q Hla + Nfa) + • ■ ■ N k (x k )]- Qk -^ . 

a + YVi(xi) ^ 

Our objective is to establish the following: 
Proposition 7.6. Let K be a flag distribution. 

(1) If ip G C^°(IR Ar ) nas support in the unit ball, then K, * ip and ip * JC are truncated 
flag kernels of width 1. 

(2) If ip G C^°(IR Ar ) /ias support in the unit ball, and if J RN ip(x) dx = 0, iaen then 
JC * ip and ip * K, are improved truncated flag kernels of width 1 . 

Proof. We can write K = JZiesA^^ 1 + Y^j=i^-r, where {ip 1 } are normalized unit 
bump functions with strong cancellation, and {ICi, . . . , K. r } are flag distributions 
adapted to coarser flags. If /C = Yliee l^ 1 }^ it suffices to show that the proposition 
is true for /Co- We consider ip * JCq. The case of /Co * ^ is handled similarly. We have 

V> * £o(x) = ^ * 
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where 





J(x) 


+ E /J ^ x ) 


+ ///(x) 






fc=i 






= U 


= (-1, • • • ,«n) 


G £n | -n < 0}, 


771 fe 


= u 


= (ii, ...,i n ) 


G | h < < ifc+l}, 


El 


= {/ 


= (ii, •••,-») 


G £ n < 



Denote the element (0, . . . , 0) G £ n by 0. If J = (ii, . . . , i n ) G then i\ < i 2 < 
• • • < -it < 0, and we put 



Then 



4 = (0, . . . ,0,i fc+ i, • • • An)- 

if/ef", 

I V = { T k if 7 G 

1 if / G £+ 



Let 



= {/ G £ re I Z! 



ifc = 0}. 



Note that if I G then 4 G ^ . According to Lemmas 17.21 and 17.31 each term 
ip * [</- J ]j has weak cancellation. Moreover, for each I G £ n there exists 9 1 G C™(M. N ), 
normalized with respect to i/j and if 1 , so that 

7e£- =-> V*[/]/ = 2" e(|n|+ - +|i " l) [^]o 

lef* ^*[ v - / ]- = 2- e d il i + - + i ife i)[^] 7fc (7.8) 

leS+ =► ^ *[</], = [0^. 

We have 

J(x) = E 2- e(|il|+ - +|i " l) [^] (x) = ^ 2 - e(|il|+ - +|i " l) ^(x) = 0°(x) 



ies n 



where the series converges to a normalized unit bump function 9°. Next, we can 
write 



j/ fc (x) = Y * * [/m*) = E [ E ^ * [A 



<\h\+-+\i k \)[Ql] I 
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= E [E 2 " e(|ill+ ''^ r ]/ x )= E 

where 

2~<\^\+-+\ik\)()i = qJ 

h = J 

converges to a normalized unit bump function. Thus each 11^ is a flag kernel with 
a decomposition into dilates of normalized bump functions where all of the dilation 
parameters are non-negative. This is also true of the term III. 
Thus we have shown that 

where 

E+ = {I = (i h ...,t n )eZ n \o<t 1 <t 2 <---< i n }. 

But then it follows from the second inequality in Proposition 111. lj, as in the proof of 
Proposition 16.91 that ip*K.Q satisfies the differential inequalities of a truncated kernel 
of width one. 



If we assume in addition that J RN ip('x) dx = 0, then we can write ip — Yli=i & 
with 

ipkix-L, Xi^ u t, x t+1 , ...,x n )dt = 0. 



We can repeat the argument given above with ip replace by ipi. It then follows from 
Lemma 17.31 that instead of the formulas given in equation (17. 8p , we get 



iee- ^*[/]/ = 2- e(lnl+ - +|i " l} [^]o 

leS% => ^* [f I ]i^2~<\ i ^ + - + \ i ^[B\ if k<l 

Ie£* ^*[^ / ] / = 2-^ i 2- e(|il|+ - +|ifel) [^] 7fc ifk>l 

Ie£+ ft *[^]i = 2-^ [9%. 



(7.9) 



Again using the second inequality in Proposition |11.1| and observing that the case 
/ = 1 gives the worst estimate, we see that ipi* )Cq satisfies the differential inequalities 
of an improved truncated kernel of width one. □ 

Remark 7.7. We point out that we can relax the C°° requirement on the function ip 
in Propsition 17.61 in the following way. An examination of the arguments in Sections 
17.11 and 17.31 and the proof just given show that for any integer m there exists an 
integer M, so that K * ip and ip * JC satisfy the required differential inequalities and 
cancellation properties for orders of differentiation not exceeding m, if ip is supposed 
to be of class C M . 
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8. Convolution of flag kernels 

Let /C G S ; (R N ) be a flag distribution on the homogeneous nilpotent Lie group 
G = R N , adapted to the standard flag associated with the decomposition R N = 
R ai © • • • © IR a '\ Define a left-invariant operator T K : S(R N ) -> C 00 ^) by setting 

Tk[0](x) = 0*/C(x) = (/C,0#> 

where, if G «S(R ), we set 0* (y) = 0(xy _1 ). If /Ci and /C 2 are two flag kernels on 
G, we want to make sense of the composition T/c 2 o , and show that the resulting 
operator is of the form T^ 3 where /C3 is a third flag kernel on G. Now formally 

T K2 o Tjc, [0] = {T Kl [0]) * /C 2 = (0 * K x ) * /C 2 = * (/d * /C 2 ), 

so the operator T^oT^ should be given by convolution with the distribution /Ci*/C 2 . 
However we cannot directly define the composition T^ 2 oT^j[0] = TJq^TJcJ^]), even 
if G C^°(IR Ar ), since TJcJ*/*] need not belong to S(M. N ). Also, in general one cannot 
convolve an arbitrary pair of distributions unless one of them has compact support. 

We will define the convolution /Ci*/C 2 somewhat indirectly. In Section IHTTI we show 
that if G S(R N ), then T K [</>] G L 2 (R N ) and the mapping T K : S(R N ) ->■ L 2 (IR Ar ) 
has a (unique) continuous extension to a mapping of L 2 (IR Ar ) to itself. This allows 
us to define T^ 2 o as the composition of two mappings from L 2 (R N ) to itself. 
Then in Section 18.21 we show that this composition is given by convolution with 
a distribution which is given as sum of convolutions of dilates of bump functions. 
The key is then to recognize this sum as a flag kernel. The combinatorics are rather 
complicated, so in Section 18.31 we work out an explicit example. In Section 18.41 we 
prove the main result, Theroem I8.4[ which shows that the convolution of two flag 
kernels is a sum of flag kernels. Finally in Section 18.51 we work out some additional 
examples. 

8.1. Boundedness on L 2 . 

In this section we show that convolution with a flag kernel extends to a bounded 
operator on L 2 (R N ). Later in Section[9]we will see more: such operators are bounded 
on L P (R N ) for 1 < p < 00. 

Lemma 8.1. Let K be a flag kernel on R N . Then there is a constant C so that if 
Tjc[4>] = * /C for G S(R N ) then \\Tjc[<f>]\\i^(vN) < C \ \(fi\\ L 2( R Ny As a consequence, 
there is a unique extension ofTjc to a bounded operator from L 2 (R N ) to itself. 

Proof. Using Corollary I6.2[ we can assume that K, is a flag kernel adapted to a stan- 
dard flag T of length n as given in equation (12. 4p . and that there is a uniformly 
bounded family of functions {tp 1 } C C^(R N ), each having strong cancellation rel- 
ative to the decomposition R N = R ai © ■ • • © R a " such that /C = Xl/e.f jv^]/ with 
convergence in the sense of distributions. 

For any/G S n let Tj[f] = f * [</],. Then ||T 7 [/]|| L2 < ||</|| Ll \\f\\„. If (f,g) = 
J RN /(x)p(x) <ix is the standard inner product in L 2 (R N ), it follows from Fubini's 
theorem that (7} [/],#) = (f,T T [g]) where Tj[g] = g* and ^(x) = y^C*" 1 )- 
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Thus the Hilbert space adjoint of the operator T^i is the operator T* = T^i. Fubini's 
theorem also shows that Tj o Tj = T^i^j and Tj o Tj — T^i^j. Thus if /, J G £ n , 
the L 2 -norm of the operators Tj o TJ and TJ o Tj are bounded by the L 1 norms of 
[(f J ]j * [v? 7 ]/ and [y? 7 ]/ * [<p J ]j- It follows from Corollary 17.41 that 

\\W\J* MAIl^r") + 1 1 Mi * Mj||i,i(kW) 

n.-l 

< C2~^ in ~ in ^ Yl [2~ Ak ~ jA +m\n{2~ e{k+1 ~ ie \2- e{k+1 ~ k) }]. 
l=\ 

(8.1) 

For any finite subset F C E n , set /Cp(x) = Xl/eF^M*)- Then for any G 
«S(R iV ), (/C, 0) = linip ^E n (JCf, 0), and in particular, if 0* (y) = 0(xy _1 ), 

DcM(x) = </C,0#) = lim </C F ,0#) = lim * [</] 7 (x) = lim £>M(x). 

F/E n F/E n FyE n ^ p 

It follows from the almost orthogonality estimate in (18.1 p and the Cotlar-Stein The- 
orem (see for example [Ste93j . page 280) that there is a constant C independent of 
the finite set F such that 



ieF 

But then Fatou's lemma implies that | |2~5c[0] | |i 2 < C ||0||i 2 for all G S(WL N ). This 
completes the proof. □ 

Corollary 8.2. Supposet that K is a flag kernel, and that K = [v 7 ]/- Then 

for allfe L 2 (R N ), 

M llE r M^- r ^ll^ = - 

Proof. Since S(M N ) is dense in L 2 (M N ), and since is bounded on L 2 (M N ), it 
suffices to show that liin^ yE n \\ TlieF^Wh^] ~Tk[ 1 P]\\l' 2 = for ip G S(WL N ). (Both 
T)c and YIigf -^h? 7 ]/ are bounded on L 2 {R N ) with norm independent of F). But for 
ip G S(M. N ), the result follows from Theorem 16.81 and the discussion following it on 
page [33j This completes the proof. □ 

8.2. Composition of convolution operators. 

Let JCj, j = 1, 2, be two flag kernels on Mr, and let TJc [0] = * JCj be the cor- 
responding convolution operators. According to Lemma I8.1[ each of these operators 
is bounded on L 2 {R N ), and hence the composition Tjc 2 o Tj^ is well-defined as a 
bounded operator on L 2 (R N ). Our main result is the following. 

Theorem 8.3. Let T\,Ti be two standard flags on M , and let J-" ^ e the coarsest 
flag on Mr which is finer than both T\ and T-i- For j = 1,2, let fZj be a flag kernel 
adapted to the flag Tj. Then T^ 2 o is a flag kernel adapted to the flag J-q- 
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In order to study the composition T^ 2 o , we want to relate it to the decompo- 
sitions of /Ci and IC 2 as sums of dilates of normalized bump functions. According to 
Corollary 16.21 we can assume that the flags T\ and F 2 are given by 

T\ ■ (0) C R an C IR "- 1 © IR a ™ C ■ • • C lR a2 © ■ ■ • © IR a " C R N , 

T 2 : (0) C R hm C IR 6 ™- 1 © IR 6 " 1 C • • • C R h2 © • • • © IR fem C R N , 

and the flag kernels are given by /Ci = YlieS if 1 }! an< ^ ^2 = Sje£ m [V' J ] J> where 
{y^ 1 1 £ £ n } is a uniformly bounded family of compactly supported functions with 
strong cancellation relative to the flag J 7 ! and {tp J \ J £ S m } is a uniformly bounded 
family of compactly supported functions with strong cancellation relative to the flag 
T 2 . 

If 0,0 G S(R N ), then 




where the limits are in L 2 (R N ) and are taken over finite subsets F C S n and G C S m . 
For every fixed finite set F C 5 n , the function 5^ /gF 0* [v^ 7 ]/ £ «S(M jV ). Since T^ 2 is 
a continuous mapping from L 2 (R N ) to itself, it follows that 




Thus in order to prove Theorem l8.3l we must study the finite sums YlieF XljeG^ 7 ]^ * 
["0 J ]j, and show that these converge in the sense of distributions to a finite sum of 
flag kernels, each adapted to a flag which is equal to or coarser than J-" - 

Since the general situation is rather complicated, we first present an example which 
may help understand the difficulties. 

8.3. An Example. 

Suppose that we are working in IR 5 with the family of dilations given by 

6-x=6-(x 1 ,x 2 , x 3 , x 4 , £5) = (6 dl xx, 5 d2 x 2 , 5 d ' s x 3 , 5 di x 4 , 5 d5 x 5 ) (8.2) 

with di < d 2 < d 3 < (I4 < The standard flags on R 5 correspond to partitions 
of N — 5. Consider two flags T\ and T 2 corresponding to the partitions 21 = (2, 3), 
where we write IR 5 = IR 2 © IR 3 , and <8 = (1, 2, 2), where we write IR 5 = IR © IR 2 © IR 2 . 
Thus 

7i is the flag (0) C {x x = x 2 = 0} = IR 3 C IR 5 , 

T 2 is the flag (0) C {x x = x 2 = x 3 = 0} ^ R 2 C {x x = 0} = IR 2 © IR 2 C IR 5 . 
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We are given flag kernels K,i = J^i^^l 1 anc ^ ^ 2 = ^Je£ 3 bP^J adapted to these 
two flags. We then want to study the infinite sum 

EE^vt^ (8.3) 

arising from the composition of the operators Tjci o T)e 2 . 

Suppose / = (ii, i 2 ) G £ 2 and J = {ji,j 2 ,h) G £ 3 , so that i x < i 2 , and ji < j 2 < J3- 
If </, ^ J G ^(R^), we have 

[^ 7 ]/(x) = 2~ il{dl+d2) - i2{d3+d4+d5) ip I (2~ dlh x 1 ,2~ d2il x 2 ,2~ d3i2 x 3 ,2~ d4i2 x^2- d5i2 x 5 ) 

[i) J ] j(x) = 2^ ldl -^( d2+Q!3 )-^( Q!4+d5 ) i() J (2- dljl x 1 , 2~ d2j2 x 2 , 2~ d3j2 x 3 , 2-^ 3 x 4 , 2- d5j3 x 5 ). 

Note that dilation by I = (ii,i 2 ) on IR 2 © IR 3 is the same as dilation by the 5-tuple 
/ = {i\,i\,i 2 ,i 2 ,i%) on IR 5 , and dilation by J = (31,32,33) on IR © IR 2 © IR 2 is the 
same as dilation by the 5-tuple J = (ji, j 2 , j 2 , j%, j'3) on IR 5 . Also note that we can 
reconstruct / and J from / and J by consolidating repeated indices. By Lemma \7.2\ 
the convolution [ip^i * [ip J ]j is equal to [6 i,j ]k where 9 I,J G C^°(IR 5 ), and where 

K = (k 1 ,k 2 , k 3 ,h, h) = IV J= (ii,ii,i2,i2,i2) V {ji,j 2 ,j2,h,h) , g ^ 

= (max{z'i,j'i},max{ii, j 2 }, max{i 2 , j 2 }, max{z 2 , j 3 }, rnax{z 2 ,i 3 })- 

We must consider the sum in (I8.3P of the convolutions [y? 7 ]/ * [V ;J ]j, taken over 
all / G £ 2 and J G £3. Each pair (J, J) G £ 2 x £3 gives rise to a 5-tuple K G £5. 
However, not all elements of £5 actually arise in this sum. (For example, it is clear 
from (18.41) that we must have fc 4 = k 5 , so the 5-tuple (1,2,3,4,5) does not arise). 
Let £(21, 53) denote the set of all 5-tuples K — (hi, k 2 , k 3 , /c 4 , k 5 ) that do arise as in 
(18. 4p . (The notation reflects the fact that this set of 5-tuples is determined by the 
partitions 21 = (2,3) and 53 = (1,2,2) of TV = 5). Then for each K G £(21,53), let 
£{K) denote the set of pairs (/, J) G £ 2 x £3 which give rise to the 5-tuple K. Once 
K G £(21, 53) is fixed, each of the terms in the inner infinite sum on the right-hand 
side of (18.51) is the K dilate of a normalized bump function 9 I,J . Then we can write 
the sum in (I8.3P as 

EEiA*[^= E E [^]/*[rt/= E E p'-v 

ie£ 2 Je£ 3 Ke£(%<B) i,Je£(K) Ke£(%<B) i,Je£(K) 

(8.5) 

We will need to show that the infinite inner sum J2i je£{K)W' J \K actually con- 
verges and is the K dilate of a normalized bump function. However, this is not 
enough to give the right description of the sum in (18.31) as a flag kernel. In the outer 
sum on the right-hand side of (18.51) . the 5-tuple K runs over £(21,53) and not over 
all of £5. We still need to partition £(21, 53) into a finite number of subsets based on 
which indices in K are repeated. To make this clear, we further analyze £(21, 53). 

As one sees from (18. 4p . the 5-tuple K = I V J depends not only on the tuples 
I — {h, ^2} an d J = {ji, j 2 , J3}, but also on the ordering of the larger set consisting of 
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{h, hjji, J2j J3}- We know that i\ < i 2 and ji < j 2 < j 3 , but this does not determine 
the ordering of the larger set. Such orderings are in one-to-one correspondence with 
decompositions of the set {1,2,3,4,5} into two disjoint subsets of sizes 2 and 3, 
where elements of the first set are indices from J, and elements of the second set are 
indices from J. Thus there are (2) = (3) = 10 such orderings. A description of these 
is given in the following Table 1: 

Table 1: Decomposition of £(21, *B) 



Decomposition 


Ordering 


K 


New Decomposition 


Free 


£(a,B)i = {l,2}U{3,4,5} 


ii <i2 < ji < 32 < 33 


{ji,32, 32, 33, 33} 


R R 2 ® R 2 


ii, «a 


£(2l,S)2 = {l,3}U{2,4,5} 


ii < ji < 12 < 32 < 33 


{31,32,32,33,33} 


R R 2 © R 2 


11,12 


£(21,<B) 3 = {1,4}U{2,3,5} 


ii < ji < 32 <i2 < js 


{31,32,12,33,33} 


R ® R ® R ® R 2 


h 


£(2l,<8) 4 = {1,5}U{2,3,4} 


ii < ji < 32 < 33 < «2 


{jl,j2,«2,«2,«2} 


R © R © R 3 


ii,33 


£(SC,!8) B = {2,3}U{1,4,5} 


ji < i\ < i2 < 32 < 33 


{ii, 32, 32, 33, 33} 


R © R 2 © R 2 


in, h 


£(St,«8) 6 = {2, 4} U {1,3, 5} 


ji < ii < 32 < 12 < 33 


{ii,32,i2,33,33} 


R © R © R © R 2 


3i 


£(SC,«8)r = {2,5}U{l,3,4} 


ji <ii < 32 < 33 < 12 


{ii,ia,i2,*a, 12} 


R © R © R 3 


3i , 33 


£(2l,23) 8 = {3, 4} U {1,2, 5} 


Ji < 32 < h < 12 < 33 


{11,11,12,33,33} 


R 2 © R © R 2 


ji , 32 


£(3,23)9 = {3, 5} U {1,2, 4} 


ji < 32 <ii < 33 < 12 


{ii,ii, «2, 12, 12} 


R 2 ©R 3 


jl ! 32 , 33 


£(0,!8)io = {4, 5} U {1,2, 3} 


ji < 32 < js <il <%2 


{«l,il,«2,«2,«2} 


R 2 fflR 3 


jl , 32 , 33 



In the first column, we have given the decomposition of {1, 2, 3, 4, 5} into two subsets, 
the first with two elements and the second with three. This then gives an ordering 
of the elements in the set 13, 14, 15} which is given in the second column. The 

third column gives the value of the 5-tuple K = I V J. In this tuple, certain entries 
can be repeated, and this corresponds to a new decomposition of 1R 5 . The fourth 
column gives this new decomposition of M. 5 dictated by the repeated indices of K. 
Finally, in each of the decompositions, certain of the indices from I or J appear in 
the 5-tuple K. In the sixth column of Table 1, we list the 'free'-variables which do 
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not appear in K are listed. It is precisely these free variables which appear in the 
inner sum on the right-hand side of equation (18. 5p . 

Table 1 shows that if K G £(21, 03), then K takes one of five forms: 

(ki, hi, k 2 , k 2 , k 2 ) (decompositions 9 and 10) leading to the flag IR 2 © IR 3 , 

(hi, k 2 , k 2 , k 3 , k 3 ) (decompositions 1, 2, and 5) leading to the flag IR © M 2 © IR 2 , 

(ki, k 2 , k 3 , k 3 , k 3 ) (decompositions 4 and 7) leading to the flag IR © IR © IR 3 , 

(ki, ki,k 2 , k 3 , k 3 ) (decomposition 8) leading to the flag IR 2 © IR © IR 2 , 

(ki, k 2 , k 3 , k^ k^) (decompositions 3 and 6) leading to the flag IR © IR © IR © ! 

(8.6) 

The outer sum on the right-hand side of (18. 5p thus splits into five separate sums: 

E5>v^ j ]j= E E E E bviv^ 

Ie&2 Je£z Ke£ 9 U£i IJ^S(K) A'e£iU£ 2 U£5 I,j££(K) 

+ E E bVt^+E E tf\i*w J \j 

Ke£ 4 U£ 7 ij(z£(K) K ^ E s I,Je£(K) 

+ E E i^*w J h- 

Our object is to show that these five sums are flag kernels, each adapted to one of 
the five flags listed on the right-hand side of (18.61) . To see this, we must show that 
in each case, the inner infinite sum converges, and has weak cancellation. Let us see 
why this happens in one case. 

Case 1: K e E 9 U E w . 

In this case, K = {ii, i 1 , i 2 , i 2 , i 2 } is fixed, and the inner sum je,?^)^]/* [ % l ]J ] J 
is over the free variables {ji, j 2 ,j 3 } which satisfy the inequalities 

ji < 32 < h < h < iz or ji < j 2 < ja < k < k- 

In order to apply Theorem 16. S\ we need to check that the sum converges to the 
i^-dilate of a normalized bump function 8 I,J , and moreover that 8 I,J has weak can- 
cellation relative to the decomposition IR 2 © IR 3 . 

To show that the sum over the free variables {ji, J2, J3} converges, we want to 
show that each term in the sum can be bounded by 2 — e [^ 1— J1 ^ + ^ 2 ~ j2 - >+ ^ 3— ■ 73 ^ where 
h,h,h ^ {^2,^2}, and ji < h, j 2 < l 2 , and j 3 < l 3 . This will follow because, by 
hypothesis, ip J has strong cancellation relative to the decomposition IR © IR 2 © IR 2 . 
Thus ip J has cancellation in x%, in either x 2 or x 3 , and in either £4 or £5. In the 
variable x±, j\ < i±, and so by Lemma [7.31 we g e t a gain of 2 -e l n-J ' 1 l < 2~ e ' n ~- ?2 L If 
there is cancellation in x 2 , we have j 2 < z'i, so we get a gain of 2- e l <1 -»l, while if 
there is cancellation in x 3} we have j 3 < i 2 and so we get a gain 

f 2-e|<a-is|. Finally, 

if there is cancellation in x 4 or X5, we have j 3 < i 2 , and so we get a gain of 2~ e '* 2 ~-' 3 l 
Taking the best of these estimates, we see that the size of [(p J }i * [?p J }j is dominated 



FLAG KERNELS ON HOMOGENEOUS GROUPS 



66 



by a constant times 2 e " n ^l+l* 1 wl+ta J3 1] _ Thus in this case we can take l± = i\, 
l 2 = ii, and l 3 = i 2 . 

The key points in this convergence argument are the following: 

(a) If f s is a free variable, it does not appear in K. Since the entries k s G K are the 
maxima of the corresponding entries of i s G / and j s G J, the free variable must 
satisfy f s < k s . 

(b) Since we will sum over the free variable f s , but the variable k s G K is fixed, it 
suffices to show that there is a gain 2~ e ^ ka ~^ a \ 

(c) The function ip 1 or ip J with the free variable f s may not necessarily have can- 
cellation in the variable x s . (For example, the free variable j 2 comes from the 
function ^Cji>jaja,i3,j3) ) anc j we on iy know that this function has cancellation in 
the variable x 2 or the variable £3). However, if there is no cancellation in the 
free variable, there is a smaller free variable where there is cancellation, and 
where the corresponding element of K is the same. (In our example, ^O' 2 ^ 2 ^ 3 ) 
has cancellation in x\, and k\ = k 2 ). 

To see that the sum of the terms [y? 7 ]/ * [4 jJ ]j has weak cancellation relative to 
the decomposition IR 2 © IR 3 , we again use Lemma T7.31 We only need to observe that 
either (p 1 or ip J has cancellation in one of the variables {xi,x 2 }, and also that either 
ip 1 or ip J has cancellation in one of the variables {x%, £4, x$}. But this is clear: for 
example, ip J has strong cancellation relative to the decomposition IR©IR 2 ©IR 2 , and so 
has cancellation in x\, and (f 1 has strong cancellation relative to the decomposition 
IR 2 © IR 3 , and so has cancellation in one of the variables {X3, £4, x§}. 

8.4. The general decomposition. 

Now let us return to the general situation. Suppose T\ and T 2 are standard flags 
arising from two (in general different) decompositions we label as (21) and (23): 

(21): R N = R ai ©•••©IT", 

(03) : R N = R bl ®---®R bm . 

Let /Ci and K, 2 be flag kernels adapted to the flag T\ and T 2 . We only need to 
consider the parts of these kernels given by sums of dilates of normalized bump 
functions with strong cancellation. (That is, for each kernel we focus on the part 
called /Co in Theorem 16.11 and disregard the other terms since they correspond to 
coarser flags). Thus we can write 

Ki = Zy]/, K 2 =Y^W\^ (8.7) 

where each ip 1 has strong cancellation relative to the decomposition 21 and each %p J 
has strong cancellation relative to the decomposition fB. Let 



FLAG KERNELS ON HOMOGENEOUS GROUPS 67 

where F C £ n and G C £ m are finite subsets. We study the double sum 

/Cf*/C 2 G = Yl E bV[^ J ]j- (8.8) 
ieFc£„ JeGceSm 

Let J-q be the coarsest flag which is finer than both F\ and Ft. 



Theorem 8.4. Let /Ci and /C 2 be flag kernels given in (8.1). Then 



lim /Cf * K° = /Ci * /C 2 

F /•£,„ 

m 

converges in the sense of distributions to a finite sum of flag kernels 5^/C M , each of 
which is adapted to a flag F^ which is equal to or coarser than the flag Fq. 

Before outlining the proof, we review our notation. If x G R , we can write 
x = (x' l5 ...,<) with x£ = (xp/ , . . . , x q G R% or x = (x'/, . . . ,x£J with x' fc ' = 

P », . . . , x q >>) G K 6fc . We let Jj = {pj, . . . , gj} and J£ = {p£, . . . , g£} so that 

{l,...,iV} = U; =1 ^ = ULi^- Define 

(r a : {1, . . . , iV} -> {1, . . . , n} so that / G J^ (J) for 1 < / < N; 

7® : {1, . . . , N} {1, . . . , m} so that I G J"^ for 1 < I < N. 

If Q'j is the homogeneous dimension of M. aj and Q" k is the homogeneous dimension of 
R b \ then 



Q'j = dp^-i — h ^ = 



If / G S n and J G £ m , the notation [y? 7 ]/ and [ , ?/ ;J ]j refers to the families of dilations 

[^ 7 ](x) = 2- [Q i il+ - +Q '" in] ip{2~^ ■ xi, . . . , 2~ in ■ 34), 

[^ J ](x) = 2-Wi' il+ - +<J ^ m V(2 _il ■ x'/, . . . , 2" J ' m ■ x'4) 

In order to compare multi-indices / = (ii, . . . , i n ) G £ n and J = (ji, . . . ,j m ) G S m 
which parameterize different families of dilations, we identify them with multi-indices 
of length N with repeated entries. Thus we define p% : £ n — > En and p<g : £ m — > En 
so that pa(-0 is the iV-tuple with %\ repeated a\ times, 12 repeated ct 2 times, etc. We 
define p<^ analogously. Thus 



Pa(J) — I — (I u . . .,I N ) = (z'x, ,ii , . . ., i n . .. 

pgj(J) = J= (Jl,..., Jjv) = ( Jl, • - • , Jl , ,j S 
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Explicitly, p a (i") = (I 1 ,...,I N ) and p®( J) = (J 1 ,...,J N ) where i, = and J t = 
j T ny Next set K = (if 1; . . . , K N ) = V p<s(J) = (ki, . . . , fcjv). This means that 

for 1 < I < N 

{jr(l) if < jr(l) 

(8.9) 
««r(0 if JV(0 < «'<r(0 

Note that we can then define a map ttjj : {1, . . . , N} — > {1, . . . , ri, n+ 1, . . . , n + m} 
so that 

{£ E {1, . . .n) if if, = v w ; 

(8.10) 
ra + £ G {n + 1, . . . , n + m} if K { = j r{t) . 

Now let us outline the proof of Theorem 18.41 For each / G S n and J G S m it 
follows from Lemma [7.21 that there is a function 8 I,J G C£°(M. N ), normalized relative 
to ip 1 and ip J , so that 

[<p I } I *ty J }j=[9 I > J } miI)VmiJ) , (8.11) 
and hence equation (18.81) can be written 

K,(*K$= [8 I,J ]p*(i)vpm(J)- ( 8 - 12 ) 

{I,J)eFxGc£ n x£ m 

We analyze this sum by decomposing the set S n x S m into disjoint subsets. Let 
*P(n, m) denote the set of permutations /i : {1, . . . , n, n+1, . . . , n+m} — >■ {1, . . . , ra, n+ 
1, . . . , n + m} which preserve the order of the first n elements {1, . . . , n} and of the 
last m elements {n + 1, . . . , n + tti}. (Explicitly, this means that if \x G ^(n,m), 
then 1 < s < t < n implies fi(s) < fi(t) and n+l<s<t<n + m implies 
fi(s) < [J<{t)). This corresponds to the ten cases in the example studied in Section 
18.31 The cardinality of ^3(n, m) is ( n ^ m )- Let I G £ n and J G £^ m , and let us write 

I = («i, ...,a n ) and J = (a n+ i, . . . , a n+m ), so that 

«i < "2 < ■ • ■ < «n and a n+ i < a n+2 < ■ ■ ■ < a n+m . 

Then let L(J, J) be the (weakly) increasing rearrangement of the set I U J = 
{«!, . . . , a n +m} so that 

if l<r<s<n then a r comes to the left of a s , 

if n + l<r<s<n + m then a r comes to the left of a s , 

if 1 < r < n, n + l<s<n + m and a r < a s then a r comes to the left of a s , 

if l<r<n, n + l<s<n + m and a s < a r then a s comes to the left of a r . 

(8.13) 

This rearrangement of {a x , . . . , a n+m } is given by L(I, J) = {a M (i), . . . , a M („+ m )} 
where /x is a permutation of the set of subscripts {1, . . . , n, n + 1, . . . ,n + m}, and 
it follows from (18.131) that \x G ^J(n, m). In this way we associate to each pair 
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(I, J) G £ n x £. m a unique \i = J) G ^(n, m). Conversely, for each /i G ^(n, m), 
let 

£n,m(n) = {(I, J) e£ n x £ rn \ L(I, J) = {a M( i), . • •,« Al(n+m) }}. (8.14) 
It follows that we have a disjoint decomposition £ n x £ m = U M e<p(m n) £n,m(/-0> an d 
we can write equation (18.81) as 

/Cf*/C 2 G = ^ ( £ [^ J k(/)v P23( j)). (8.15) 

H&${m,n) (/,J)e£„, m (At)n(FxG) 

Now let 

E n (jj) = {KeE N \ K = p<&(I) V P<b(J) where (J, J) G £ n , m (v)} • (8-16) 

Note that in general En(/j) is a proper subset of En, and an element K G E^{ix) 
can be represented in many ways as p%(I) V p<$(J) with (J, J) G £„ ;m (/i). We can 
then rewrite (18.151) as 



Kf**?= E ( E ( E 

/jeqj(m,n) KeE N {n) (J,J)e£ n ,mMn(FxG) 

p a (/)vp<8(J)=i<: 



E ( E E 



3 /,J 



,ueqj(m.,n) K&E N {n) (/,J)6£„, m (/i)n(FxG) 

Pa(-f)Vp«(J)=K 



A' 



(8.17) 



We will prove in Lemma [8.81 below that because the functions {f 1 } and {ip J } have 
strong cancellation, the innermost sum 

9 I,J = lim 0l,J ( 8 - 18 ) 

(i,J)££ n ,mM ays™ (y)6f»,m&j)n(PxG) 

P3i(I)Vpv(J)=K p°AI)vp<b(J)=k 

converges to a function A ' G Co°(R iV ) which is normalized relative to the families 
{(fi 1 } and {'0' 7 }. From this it follows from ("18. 1 T[) that 

Ki * /C 2 = hm /Cf * /C 2 G = £ ( £ [B*)A (8.19) 

G /-Em ii&${n,m) K&E N (ii) 

We will also see in Lemma [8.81 below that for each fixed /i, the functions {O^} for 
K G En (/j,) have weak cancellation relative to a decomposition of WL N depending on 
/j,, M. N = M Cl © • ■ ■ © IR Cr , and hence the inner sum on the right hand side of (18.191) 
is a flag kernel relative to the corresponding standard flag T^. Once this is done, we 
will have established Theorem 18.41 

We now turn to the details of the proof. We begin by studying the iV-tuple 
K = (Ki, . . . ,K N ) = p%(I) V p<b(J) if (/, J) G £ ntTn (fi). Partition K into disjoint 
subsets of consecutive entries where two successive elements Ki and Ki + i belong to 
the same subset if and only if either 
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(i) Ki = i a{i ) > j T( i), K l+1 = i a{ i+i) > j T (i+i), and a (I) = a (I + 1); or 

(ii) Ki = j T {i) > i«r(i), Ki +1 = j T {i+i) > ^(t+i) , and r(Z) = r(Z + 1). 

In particular, if two successive elements K\ and belong to the same subset, they 
must be equal. Thus we write 

K = {K 1 ,...,K N ) = ({K ai ,...,K fil }, {K a2 ,...,Kp 2 },...,{K ar ,...,Kp r }) 

where 

K ai = K ai+ i = ■ ■ ■ = Kp r , K a2 = K a2+ i = ■ ■ ■ = Kp 2 , ■ ■ ■ K ar = K ar+ x = ■ ■ ■ = K@ r . 
We can also write 

Ci C2 C r 

K = (Ki,...,K N ) = (ki,...,ki; k 2 , ...,k 2 ;...; k r ,...,k r ), (8.20) 

so that ci + C2 + • • • + c r = N. Note that 1 < r < m + n since either z'i or ji does 
not appear in ii'. 

We have the following properties of this decomposition. 

Proposition 8.5. The integers {a\, fix, . . . , a r , /3 r } depend only on the permuta- 
tion fi and are independent of the choice of {I, J) € £ n ,m{^)- In fact, let K\ = 
{K ap . . . , Kp t } be one of the subsets of consecutive indices in K = p%(I) V p<s(J) G 
£n,m{y)- Then: 

(1) The starting position a\ coincides either with the starting position of one of the 
subsets of p%(I) or with the position of one of the subsets of p<g(J). More pre- 
cisely, 

(PJi) If Ken = M Q i)> then a{a{) > a(ai — 1) , so the start of Ki coincides with the 
start of the index i a ( ai ) i n P%{I)- 

(Pd) If K ai = j T (ai)> then r{a{) > r(ai — 1), so the start of Ki coincides with the 
start of the index j T (ai) inp<s(J). 

(2) If the ending position of K\ does not coincide with the end of the corresponding 
segment of p%(I) orp^{J), then the entries of the segment which do not appear 
in K\ are bounded above by the corresponding entries of K. More precisely 

d2k) Suppose that K ai = i a (a t ), so that K t = i a ( ai ) for ct\ < t < /?;. Suppose 
that I t = I ai for ati < t < 7 and that 7 > Then I t < Jt — Kt for 
0i + 1 < * < 7- 

([2b) Suppose that K ai = j T (ai)> so that K t = j T ( ai ) for an < t < 0i. Suppose 
that Jt = Jai for ai < t < 7 and that 7 > fy. Then J t < It = K t for 
0i + 1 < t < 7. 

Proof. We begin by establishing part ([Taj) . If it were not true, then since j T ( at -i) ^ 
3t(ch)i we would have 

jr( ai ~l) < jr(ai) < Ma,) = Vo,-l), 
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and by condition (E]) in the definition of K\ it would follow that K ai and K ai -\ belong 
to the same subset. Part (llbp follows in the same way. To establish part (I2aj) . observe 
that since Jp t +i < Jt, it suffices to show this for t — + 1. But if J^+i < Ip t +i, 
it would follow from (JTJ) that Kp l+ \ belongs to the same subset as K a[ . Part (12bl) 
follows in the same way. 

Now parts ([I]) and (j2J) show that the positions where we decompose K depend on 
the ordering of the entries of p%(I) and p<s(J), and not on the entries themselves. 
This shows that the decomposition depends only on /i, which completes the proof. □ 

Definition 8.6. Let /x G *#{n, m). 

(1) i£ follows from Proposition \8.5\ t7ia£ i/ie permutation \x determines r and the 
integers {ci, . . . , c r }. T/ras /i determines the decomposition R N = R Cl © • • • ©IR Cr . 
We letT a denote the corresponding standard flag (0) C R Cr C ]R Cr - 1 ©]R Cr C ■ ■ • C 
rffi-ffir»a c '®-.® R c >- = R*. For x G l w , we wnie x = (x 1; . . . , x r ), 
and we /et { Ji, . . . , J r } denote the corresponding sets of subscripts so that X[ is 
a coordinate in R Ck if and only if I G Jk- 

(2) Let I = (it, i n ), J = {j u ...,j m ) G £n,m{n)- An mdex k is free if h < Jt for 
all t such that a(t) = I. An index ji is free if Jt < It for all t such that a(t) = I. 
In particular, a free index does not appear in the set K = p%(I)\/ p<s(J) G E^(ix). 

Note that whether or not an index is free depends only on the choice of \i, and not 
on the choice of (J, J) G £ n ,m(/-0- The number of free elements is equal to m + n — r, 
and 1 < m + n — r < m + r. 

Proposition 8.7. Fix \x G ^(n, m), and let (I, J) G S n _ m (fi) with I = . . . , i n ) and 
J = (ji, . . . , j m ). Let (p 1 , tp J G C^(M. N ), and suppose that ip 1 has strong cancellation 
relative to the decomposition M. N = IR ai ©- • -©IR an ; and thatip J has strong cancellation 
relative to the decomposition R N = R bl © • • ■ © R bm . 

(1) Suppose that an index i\ is free, and that i\ = I r = J r+1 = ■ ■ ■ = I s is the 
corresponding group of indices in p%(I), so that J r _i = and J s+1 = i l+x . 
Then the function (p 1 has cancellation in the variables {xi r , . . . ,xi g }, and It < Jt 
for r < t < sJl 

(2) Suppose that an index ji is free, and that {ji = J r = J r+1 = ■ ■ ■ = J s } is 
the corresponding group of indices in pgj(J), so that J r _x = and J s+1 = 
ji + i. Then the function ip J has cancellation with respect to one of the variables 
{x Jr , . . . ,x Js }, and J t < I t for r < t < s J3 



8 It follows from Lemma 15.11 that (p 1 can be written as a sum of functions each of which has 
cancellation in one of the variables {xi T , . . . , Xi g }. 

9 It follows from Lemma 15.11 that ip J can be written as a sum of functions each of which has 
cancellation in one of the variables {xj T , . . . , xj g }. 
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(3) Let W N = M 01 © ■ • • © IR Cr be the decomposition corresponding to fi, and let 
{ J±, . . . , J r } be the corresponding sets of subscripts. Let 1 < I < r. 

(a) Suppose that K ai = ■ ■ ■ = Kp t = it so that a{a{) = ■ ■ ■ = a (Pi) = I. Then 
either ip 1 has cancellation in a variables Xt with t G Ji, or ip 1 has cancellation 
in a coordinate x t with t > fli, in which case K t = J t > I t = ig. 

(b) Suppose that K ai = ■ ■ ■ = Kp t = j# so that t(oci) = ■ ■ ■ = r(j3i) = I. Then 
either^ 3 has cancellation in a variables x t witht G J\, orip J has cancellation 
in a coordinate x t with t > fli, in which case K t — I t > J t — 

Proof. To prove assertion (1), note that since %i is free, it does not appear in K. 
Hence for any r < t < s, i\ ^ max{/ f , J t } = max{^, J t }, and so Jg > ig — It- 
Since ip 1 is assumed to have strong cancellation and {xi r , . . . are precisely the 
variables corresponding to the index i r , this establishes (1). The proof of assertion 
(2) proceeds in the same way. 

To prove assertion (!3al) . note that by Proposition 18. 5[ part (flat) , the only way in 
which it is possible for cp 1 not to have cancellation in a variable x t with t G J\ is 
if I t = I ai for ai < t < 7 and that 7 > fy. But then the conclusion follows from 
Proposition 18.51 part ( l2al) . The proof of assertion ( I3bl) follows in the same way. □ 

Lemma 8.8. Fix /i G ^(n, m). 

(1) Let K G E N (n). Then there exists Q K G C^{R N ), normalized relative to the 
families {<p ! } and {ip J } so that the sum 

(m) 

converges (uniformly) to [Q k ]k- 

(2) The function Q K has weak cancellation relative to the decomposition of ¥L N cor- 
responding to fl. 

Proof. Let (I, J) G £ n , m {.n)- Suppose that (p 1 has cancellation in the variables 
{x n , . . . , x Tn } with ri G J/, and that if) J has cancellation in the variables {x Sl , . . . , x Sm } 
with Si G J/'. Let 

A = {I G {1, . . .,n} I i n < j rt }, B = {I G {1, . . . ,m} \j Sl < i Sl }. 

Then it follows from Proposition 18.71 that if i\ is a free index, / G Aq and if ji is a 
free index, then / G Bq. On the other hand, according to Lemma [7.3[ we can write 
each [y 7 ]/ * [4 jJ }j as a finite sum of terms of the form 

J 2 -^ s -^l ] [ 2- e|im '" im ' 1 ] [ 2- e| ^+ 1 "^ 1 ] [ 2~ ebm i+ 1 ~ im * 1 "[<%."[ d mt [9] 
seA' taB 1 s€A" teB" saA'" teB'" 

(8.21) 

where 



FLAG KERNELS ON HOMOGENEOUS GROUPS 



7:5 



(1) A', A", A'" are disjoint subsets of {1, ... , n} with A' U A" U A'" = {1, . . . , n}; 

(2) B', B", B'" are disjoint subsets of {1, . . ., m} with B'UB"UB"' = {l,...,m}; 

(3) A' C A C A' U A" and B' C B C B' U B". 

(4) each function # depends on {A', A", A'", B', B", B'"} and is normalized rela- 
tive to ip and ip. 

Since is fixed, the sum for p%(I) V p<&{J) = K is precisely the sum over the set 
of free indices in . . . , i n ,ji, ■ ■ ■ ,jm}, and these are contained in the indices in 
A' U A" U B' U -B". The exponential decay in the powers of 2 in equation (18.211) 
show that the sum over all the free indices converges, and what remains satisfies the 
requirements for weak cancellation in Definition 15.51 This completes the proof. □ 

8.5. Further Examples. 

It may help to consider two additional examples. 

Example 2: Suppose that N = 5, and that the two partitions of {1,2,3,4,5} 
are A = {2,3} and B = {2,3}. Thus A and B come from the same decomposition 
]R 5 = M 2 © IR 3 . There are (2) = 6 different decompositions of {1,2,3,4} into two 
disjoint subsets of cardinality 2 and 2. The six decompositions, the 5-tuples J, J, 
and / V J, and the resulting new decomposition C of {1, 2, 3, 4, 5} are listed in Table 
2 below. This example is typical of the convolution of two kernels coming from 
the same flag (in this case coming from the decomposition {2,3}). Note that all 
decompositions lead to the same decomposition {2, 3}. Thus the convolution will be 
a flag of the same type. 

Table 2 



Decomposition 


Ordering 


K 


New Decomposition 


C 


Free variables 


{1,2}U{3,4} 


ii <i2 < ji < 32 


{31,31,32,32,32} 


R 2 ©R 3 


{2,3} 


ii,i2 


{1,3}U{2,4} 


h < jl < 12 < j2 


{31,31,32,32,32} 


R 2 ©R 3 


{2,3} 




{1,4}U{2,3} 


il < jl < J2 < 12 


{ji,3i,h,i2, 12} 


R 2 ©R 3 


{2,3} 


ii,32 


{2, 3} U {1,4} 


jl < ll < 12 < j2 


{ii,ii,32,32,32} 


R 2 fflR 3 


{2,3} 


i2,ji 


{2, 4} U {1,3} 


ji <ii < ji < i2 


{ii, ii, 12, *2, 12} 


R 2 ©R 3 


{2,3} 


31,32 


{3, 4} U {1,2} 


ji < 32 < h < 12 


{il, il, Z2, 12, 12} 


R 2 ©R 3 


{2,3} 


31,32 
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Example 3: Suppose that N = 5, and that the two partitions of {1, 2, 3, 4, 5} are 
A = {5} and B = {1,2,3,4,5}. Thus m — 1 and n — 5. There are then (j!) = 6 
different decompositions of {1,2,3,4,5,6} into two disjoint subsets of cardinality 1 
and 5. The six decompositions, the 5-tuples X, J , and X V J, and the resulting new 
decomposition C of {1, 2, 3, 4, 5} are listed in Table 3 below. 



Table 3 



Decomposition 


Ordering 


K 


New Decomposition 


C 


Free variables 


{1} U {2,3,4,5,6} 


h < ji < 32 < 33 < 34 < 35 


{31,32,33,34,35} 


ReIRffiK®R®IR 


{1,1,1,1,1} 


il 


{2} U {1,3,4,5,6} 


ji <h < 32 < 33 < 34 < 35 


32, 33,34,35} 


R®RffiRffiRffiIR 


{1,1,1,1,1} 


ji 


{3} U {1,2, 4, 5, 6} 


ji < 32 < il < 33 < 34 < 35 


{il,n,33,34,35} 


r 2 e r e R e m 


{2,1,1,1} 


31,32 


{4} U {1,2, 3, 5, 6} 


ji < 32 < 33 < il < ji < 35 


{h,ii,ii,j4,35} 


r 3 e r e r 


{3,1,1} 


31,32,33 


{5} U {1,2, 3, 4, 6} 


ji < 32 < 33 < 34 <ii < 35 


{ii,ii,il,ii,is} 


r 4 e R 


{4,1} 


31,32,33,35 


{6} U {1,2, 3, 4, 5} 


3i < 32 < 33 < 34 < 35 < h 


{ii,ii,ii,ii,ii} 


R 5 


{5} 


31,32,33,34,35 



This example is typical of the convolution of a Calderon-Zygmund kernel with a 
kernel that is as fine as possible. In this case, the flag A is coarser than flag B. 



9. //-ESTIMATES FOR FLAG CONVOLUTIONS 

In this section we establish the boundedness in L P (G) for 1 < p < 00 of the 
operator / — > JC * f given by convolution on G with a flag kernel. To simplify 
the notation, we limit ourselves to the special situation where the exponents of 
the dilations d±, d 2 , • • • cZjy equation (12. ip are positive integers. The results proved 
below will go over to the more general context with essentially no change in the 
proofs. We will also find it convenient to consider a continuous parameter for 
the dilation of the Xk variable, in place of the dyadic version T h appearing in the 
previous sections. Again, the various results above stated for the dyadic dilations 
have simple modifications valid for their continuous analogues. 

9.1. Maximal Functions. 

As usual, G is a homogeneous nilpotent Lie group that we identify with M, N as in 
Section 17.11 We also let 

G k = {x = (Xi, . . . , x n ) e R N I Xx = - - ■ = X fc _! = 0} 

= {(0,--,0,x fe ,..,x n )el Ar |x i Gr,Ki<n}. 

We can identify Gk with ]R Qfc © • • • © IR a ™ , and it follows from the formula fl7.ll) for 
group multiplication that Gk is a subgroup of G. We let m(E) denote the Lebesgue 
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measure of a set E C G — G±, and rrik(E) denote the Lebesgue measure on G k of a 
subset E C Gk- For s = (s fe , . . . , s„), let 

R s = R { s k) = {(Xfc, . . . , x n ) G G fc : |x fc | < s£, . . . , |x n | < s£}. 

We say that the size of the rectangle R s is acceptable if Sk < s/c+i < • • ■ < s n . 

Definition 9.1. T/ie maximal function M, defined on G = Gi, is given by 

M(f)(x) = sup ^-jj^ \f(x-y^)\dy 

where the supremum is taken over all acceptable rectangles R s = R^ C G = G±. 
Theorem 9.2. 

(a) M is a bounded map of L P (G) to itself, for 1 < p < oo. 

(b) For 1 < p < oo there are constants A v so that if {fj} are scalar-valued functions 
on G then 

ll(I>(« 2 ) 1/2 |U, < A,||(£l//nUy 

j j 

To prove this theorem, we consider the standard maximal function Mk on the 
subgroup Gk defined by 

Af fc (/)(x) = sup * / \f(xy- l )\dy 
p >o m(B(p)) J 

B(p) 

where B(p) = B h (p) is the automorphic one-parameter ball given by 

B (k) (p) = {(**, . . . ,x n ) 6 G k | |x fc | < p fc , |x fc+ i| < p fc+1 , . . . , |x„| < p n ). 

Let Mfc be the maximal function in G obtained by lifting Mk in Gk to G. (Facts 
about lifting are reviewed in the Appendix, Section [TBI) . The key lemma is 

Lemma 9.3. There is a constant C so that 

M <CM n o M n _j o...oM x . 
Proof. Let s = (s k , s k+1 , ...,s n ) and s = {s k +i, s n ). Let 

XpW = the characteristic function of the rectangle R^ in the subgroup Gk, 

_rC s 

— the characteristic function of R^ k+1 ^ in the subgroup Gk+i, 
Xs(*)r Sfc ) = the characteristic function of the ball B^ k \sk) in Gk- 
Let t] {k), t] (k+i), r] <fc) be the normalized versions of these functions, so that, for 
example, rj„(k)(x) = m^R^)' 1 x R ( fe ) with a similar definition for 77 (*). The first 
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observation to make is that if s k < s k +i ... < s n , there is an estimate n < 
ry R (fe) * ?7o(*+i), hi the sense there are constants c, C, so that 

r) R (k) <Cr) ( fe ) * (&,. ® ?7 R (fe+i)) (9-1) 

where the convolution is now on the group G k and 8 Xk denote the delta function of 
the x k variables^ In fact, 



XB( Sk ) * (5 Xk ® = / ■ 2/ *) X R ^+i(y)dy. 

We introduce a new coordinate system in G^, so that if x G Gk, then 

x = (x fc , x fc+ i, . . . x n ) = (x fc , 0, ... 0) • (0, x' fc+1 , . . . x4) = (x fe ) ■ x', 
with x' G Gfc+i. In this new coordinate system the integral can be written as 

/ XB(s k )(x k ■ y') Xri+^y'- 1 ■ xW- (9-2) 

Now if x G R,is\ and c > is small, then |xj.| < c fc s^ and x' G ^v_ +1 , with c' small 
with c. (This is because x^- = qj(x) with gj homogeneous polynomials of degree 

j, k+1 <j < n.) So if y' G B^S then y'" 1 ' x ' e R ~ s +1 - Thus for x G r£\ 
the integrand above is 1, whenever y' G i^st+r The result is that the last integral 

exceeds m(B^ 1 i ) for x G . Dividing through by the normalizing factors and 
observing that 



m k (B^) m k+1 (Ri +1 ) ™ ^ m k (R^) 

proves the claim (19. ip . Proceeding this way by downward induction, starting with 
the trivial case k = n, gives 

V R w < Cri w * (5 X1 ® 77 (2) ) • ■ ■ * (5^...^ ® rj (n)) (9.3) 
whenever s± < s 2 . . . < s n . The inequality (19. 3p then implies Lemma [9.31 □ 

We now turn to the proof of Theorem 19. 21 For each k, the maximal functions 
M k satisfying the usual weak-type and L p estimates on L p (G k ) (because the balls 
B^(sk) satisfy the required properties for the Vitali covering argument). Moreover, 
the vector- valued version 

||£>*c«) 2 ) 1/2 L (0k) < ^ikEiAf HL (0l) P.4) 

3 

also holds. This can be shown by following the main steps in the case of M. n (see e.g. 
[Ste93j . Chapter 2). In fact, one proves first a weak-type inequality for the vector- 
valued case, using a Calderon-Zygmund decomposition, which establishes ( 19. 4p for 



10 



Inequality (|9.1|) is essentially contained in Subsection 17.21 
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1 < p < 2. An additional argument is needed for p > 2, and is based on the fact that 



^ M k (f) (x)u(x)dx < A / \f(x)\ 2 (M k u)(x)dx 

for all positive functions u. Next a lifting argument (see the Appendix) allows one 
to lift flSU) on G k to G to get 

ll(Ei*tt)i 2 ) I/2 L, (0) ^ ^IIG>i 2 ) 1/2 L (0) » 

As a result we obtain a similar inequality for M„oM„_i o . . . oM±, and an application 
of Lemma 19.31 then proves Theorem 19.21 

Our actual application of the estimate in equation (19.41 ) is contained in the fol- 
lowing. 

Corollary 9.4. Suppose F t (x) is a measurable function of (t,x) G (!R + ) n xIR Ar . Then 



||( J (M(F t ))(x)) 2 dt) 1/2 \\ LP(RM) < A p \\{ J \F t (x)\ 2 dtf 2 \ 



I LP 



Proof. Assume first that F t (x) is jointly continuous and has compact support. For 
each e > 0, apply the conclusion (19.41 ) to the case where {fj{x)} are an enumer- 
ation of the e n / 2 F €ilj€i2j ... ein (x), for i 2 , . . . i n ) ranging over (Z + ) n , and then let 
e — > 0, obtaining the desired result in this case. For the general F t , assuming that 

II(/(r+)" l-^( x )| 2 ^) 1 ^ 2 |l LP ( G ) * s fi n he, find a sequence F^ n \x) of continuous functions 

of compact support, with F^ n \x) — > F t (x) almost everywhere, so that 



J(R+) n J(M+) n 



\D> 



and apply the previous case, via Fatou's lemma. □ 

It will also be useful to observe that effectively the estimate (I9.3P can be reversed 
in the following way. 

Lemma 9.5. We have 

V B w * (4i ® v B m) * ■■■ * {Sx!,..^-! ® V {n) ) < Crj m (9.5) 

s l s 2 (sn) cs* 

for an appropriate C > 0. Here s* = (s*, . . . s*), with sl = max{s-,-, 1 < j < k}. 
Note that we do not require that Si < s 2 ■ ■ ■ < s n . 

The proof is based on the observation that r\ (to * (5 Xk (g) r] R k+i) < cr]^^ where 



S 3 



max{sfc, Sj} if j > k, 
Sk if j = k. 
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In fact if x ^ (for some large C) then either |x^.| > C s k or x' ^ R~ . Looking 
back at the integral in equation (19.21) we see that the integral vanishes, because 
XB(s k )(xk ■ y') = in the first case, or x R ( k + 1 ) (y'^ 1 ■ x') = in the second case. 

Moreover as above, this integral is majorized by Cm(B^^). Altogether then, we 
have r] B (k) * (5 Xk <g) r] R {k+i) ) < c r] R (k) , and an induction proves Lemma 19.51 

Now let A s denote the function appearing on the left-side of (19.51) . Then as a 
consequence we have 

\(f*A s )(x)\<CM(f)(x), (9.6) 

for all s = (si,...,s n ), not necessarily in increasing order. Similarly if A* s = 
(8x1 ® ^o(n)) * ••• * (Vrw) we a l so have < cq„iy,. This follows from 

(5) if we observe that A*(x) = A s (x~ x ). As a result, in analogy to (19.61) . we have for 
all s 

\(f*A* s )(x)\ <cM(f)(x) (EBI) 

Indeed, one has R~* = R s * if, in defining R s , a coordinate system is used where the 
inverse of x = (x 1; . . . , x n ) is given by x~ x = (— x x , . . . , — x„). Alternatively, if we 
use cannonical coordinates of the second kind, as above, then one has R ClS * C -R^* 1 ^ 
R C2S *, for two appropriate constants c\ and c 2 ; this also leads to (19.61 ). 

9.2. Comparisons. 

The basic comparison function is 

V t (x) = t x ■ t 2 ■ ■ - ■ t n - n (ti + t 2 - ■ ■ + t k + JVi(x) • ■ • + iVfcOr))^- 1 

k=l 

for t = (tx, • • • ,t n ), tj > 0. Recall that N k (x) = Ixk] 1 ^ and Q k = ka k with the 
dimension of the Xk space. 

Theorem 9.6. 

sup |(/*r t )(x)| <CM(/)(x), (9.7) 
t 

where the supreme is taken over all t, with tj > 0. 

Proof. Note that it suffices to restrict attention to t's that are of acceptable size. 
Indeed, let Sj — t\ + t 2 ■ ■ ■ + tj, 1 < j < k. Then Sj < Sj + i but k(ti ■ ■ ■ + t k ) > 
S! + s 2 --- + s k . Hence (t x ■ ■ ■ + t k + N x + ■ ■ ■ + iV^)^" 1 < c k [s 1 + s 2 ■ ■ ■ + s k + JVi + 
••• + N k )^ ( ^ k ^ 1 , with c k = k® k+1 . Therefore T t (x) < cr s (x), which shows that it 
suffices to consider t's that are increasing. 

We next fix t = (tx, • ■ -t n ) and decompose the space G = M. N into a preliminary 
dyadic partition as follows. For each J = (ji, . . .j n ) G Z" we let 

Rj = {x£ R N I < N ^) + --- N ^) < jjfc for fc = i, 2j . . .„}, 
1 1 ti + 1 2 h t fc - J ' 
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with the understanding that if j k = the inequality should be taken to be 

Nijx) ■ ■ ■ + N k (x) < 1 
t\ + t 2 ■ ■ ■ tk 

Notice that IJjez™ = G gives a partitioning of the space G. However, in general 
each Rj is not comparable to an acceptable rectangle. We remedy this as follows. 
For a suitable constant c, define s J = (s{, ■ ■ ■ s£) by 

s{ = ct l 2 j \ s J 2 =cH 2 2 j \ ... s J n = c n t n 2 jn . (9.8) 

Now if Rj is non-empty, then since t\ < t 2 ■ ■ ■ < t n , we have 

Ni(x) ^h2 J \ N 1 (x) + N 2 (x) ^t 2 V\ ■■■ Nt(x) + N 2 (x) ■ ■ ■ + N n {x) ^t n 2^. 

As a result, for sufficiently large c, it follows that s{ < s 2 ■ ■ ■ < s J n . 

Now define Rj = {x : Nf.(x) < s{,k = l,...n} for those J where Rj is not 
empty. Then clearly Rj C Rj and each i?} is a rectangle of acceptable size (in 
fact, essentially the smallest rectangle of acceptable size containing Rj). However 
for / > 0, 

/ /(x ■ y^)r,(y)rfy = E / f^Y'^^dy. 

G J& 1 Rj 

Recall that by (JHSD, t k « s^2" Jfc , and on i?j we have iVi H N k « (t a + t 2 h 

4)2^ « s£. Thus, on i?j we have T t (y) < U (s()~ Qk ■ (2~ Qkjk ). So 

k=l 

f f{xy- 1 )Y t {y)dy<Y,^- Qk0k i4r Qk ■ [ fi^dy. 
I j k=l J*i 

But each Rj is a rectangle of acceptable size and n (sjO = cm(Rj). Thus by the 

fc=i 

definition of M, the last sums is majorized by cV n 2' Qkjk M(f) = c'M(f), and 

j k = l 

( 19. TP is proved. □ 

9.3. Truncated kernels. 

Recall that we defined truncated kernels and improved truncated kernels in Def- 
inition [73] (Section El]). Suppose that ip G C™(WL N ) with support in the unit ball. 
For b > write 

M*) = b- Qi - Qa - Q ^(b- 1 x. u r 2 x 2 , . . . r n x n ), 

the automorphically dilated ift. We also say that ipb has width b. 

Theorem 9.7. Suppose JC is a truncated flag kernel of width a, and ipt, is as above 
of width b. Then 

(1) K. * ipb and ipf, * K, are truncated kernels of width a + b. 
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(2) If in addition j G ifj(x) <ix = 0, then K * ipb and ipb* K are improved truncated 
kernels of width a + b. Moreover, then K * ^ and ipb * K are actually improved 
truncated kernels of width a + b, multiplied by the further factor 

Note that the statements of the hypotheses and conclusions have an automorphic- 
dilation invariance, so in proving Theorem 19. 7\ it suffices to consider two cases: b = 1, 
a < 1; and a = 1, b < 1. In the first case we use Proposition 17. 6[ since any truncated 
kernel is actually an un-truncated kernel. Thus we get that K * ipi and ipi * K have 
width 1, which is essentially the same as having width 1 + a, since a < 1. 

The second case could be easy if every truncated kernel of width 1 were of the 
form K * ipi. Its proof is a little more involved and requires the following lemma. 

Lemma 9.8. Given any M , there exist 7] and r]i both of class C^ M \ supported in 
the unit ball, and a (non- communicative) polynomial P(Xx, ■ ■ -Xn) = P{X) in the 
right-invariant vector fields of G, so that 

P(X)r ]o = 5 + r ]l . (9.9) 

with Sq the Dirac delta at the origin. 

N 

Proof. Consider the elliptic operator of order 2r, P(X) = {^X^ , with r a positive 

integer. Then by the standard theory of pseudo-differential operators there is a 
locally integrable function F which is C°° away from the origin, so that P(X)F = 
5o + rj', with 7]' a C 00 function. Moreover, F satisfies the estimate | (J^) -^(x)! < A a , 
whenever |x| < 1 and 2r > N + |a|. (These estimates also follow from [NRS 90J. 
Theorem 1.) Thus we only need to take 2r > M + N and set rjo — ji ■ F, where fi 
is a C°° function supported in the unit ball, and fi{x) = 1 in the ball of radius 1/2. 
Then since rjo is supported in the unit ball, so is rji = P(X)rj — <5 ; and since F is 
C°° away from the origin it follows that r]i is in fact C°° everywhere. This completes 
the proof of Lemma 19.81 □ 

We now return to the proof of Theorem 19.71 We consider K*tpb when K has width 
1, and b < 1. Now by the lemma K*ipb = K*5o*ijjb = K *P(X) *?7o*V ; & + K *rji*ijjb, 
since P{X) is a right-invariant differential operator. Now since K has width 1, 
K * P{X) is also a truncated kernel of width one, and in particular an un-truncated 
kernel. However, i]q * ipb has width 1 + 6, which is essentially one. Also, it is of class 
C( M ^ (uniformly in b), since t] is of class C^ M \ Thus K * P(X) * r/ * ip b satisfy 
the differential inequalities for a truncated kernel of width one for all orders < m. 
However, the term K * rji * ipb clearly does the same, for all orders. Notice we can 
make m as large as we wish by making M sufficiently large. (See Remark 17.71 on 
page |59~1) A similar argument works for ipb * X and thus part (1) of Theorem 19.71 is 
proved. 

Part (2) is proved in the same way, using conclusion (2) of Proposition 17.61 The 
further improvement given by the factor fe(a + 6) _1 comes about as follows. As before, 
we may take a = 1, and b < 1. Since J ^(x|dx = 0, both r] * ipb and rji * ipb give 
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an improvement of b. In fact, since J i()(x)dx = 0, it follows from Lemma 15.11 and 
Proposition 14.51 that we can write 

i, b = Y} k M^ ] ) (9-10) 

for suitable C°° functions ip^ supported in the unit ball, with {Xk] ranging over 
right-invariant vector fields of degree k , k > 1. Thus rjo * ipb = ^^& fe (?7o * Xk) * "06 
and this gives a gain b, b < 1. Similarly for the term 771 * ipb- 

9.4. Key estimates: kernels. 

Suppose cp( k > G Cq° is supported on the unit ball of the group Gk, with 



/ <^ (fc) (x)rfx = 0. 



We set (pt(x) = t Qk '" Qn ip(5 t -i(x)), with Q k = kak and let (p^> to be the corre- 
sponding distributions lifted to the full group G; i.e. fit = 5xi,x 2 ---x fe _ 1 ® V 3 ^ ■ We 
let $ t = $g * tpg (pfj for t = (*x, . . . , t n ), and write 

Recall the comparison function T t discussed in Section 19.21 Note that here we will 
allow the functions (p and $ to take their values in finite-dimensional vector spaces. 

Theorem 9.9. Suppose K, is a flag kernel. Then 

(1) \K * $t(x)| and |$J * -^( x )l are both majorized by cTt(x) for all t. 

(2) If X^ is any right-invariant vector field of degree k, then 

\X*{K*<$> t )\ < c {t l + --- + t k y k T t {x). 

(3) If X k is any left-invariant vector field of degree k, then 

\Xt(K *$*)!< c(h + ■■■ + t k )- k T t (x). 

For the proof we need to do our calculations in a particular coordinate system, 
already used in the proof Lemma [9731 Here we represent a point x = (xx, . . . , x n ) G G 
via adapted canonical coordinates of the second kind; i.e. we take x = exp(x' 1 • 
Xx) exp(x' 2 • X 2 ) ■ ■ ■ exp« ■ X n ), where x' = (x' k l , . . . , x' k>ak ), {X M , . . . , X k ,a k } is 
a basis of the sub-space of vector fields of degree k, and x' fe ■ X k = J2j x 'kj-^k,j- 
The passage from the initial (xx, . . . x n ) coordinates to the (x' 1; . . .x.' n ) coordinates 
is of the form treated in Section 16.51 so that the basic comparison function T t is 
essentially unchanged when passing from x to x'. We therefore freely use instead 
the new coordinate system x', and now relabel x' by x. The advantage of this 
coordinate system is that, firstly y = (yi, . . . y*., y^+i, • • • , y n ) G Gk if and only if 
yi = 0, . . . , yfc_i = 0, but more importantly, if x G G, and y G Gk, then x ■ y = 
(xi, ■ ■ ■ Xfc_x, Xfc, . . . x n ), with X£ ( for £ > k), depending only on x^, . . . x n , and 
y fc , . . . y n , and not on Xx, . . . x fc _x. 
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Now set K^\x) = K * (j$ (p® . Consider first K^(x) = K * tp®. Ac- 

cording to Proposition 17.61 P ar t (2), since f G ip^dx = 0, then for each x 1; the 
kernel K^(x.x, x 2 , . . . , x n ) as a function of (x 2 , . . . , x n ) on G 2 , is a truncated ker- 
nel of width t\ + Ni(xi), multiplied by the "constant" factor t\ \t\ + Aq(xi)] Ql \ 
Also we have a similar conclusion for d^K^(x), except now the improving factor is 

tl [tl + Mx!)]- 91 - 1 -*. 

Consider next the inductive hypothesis: for a given k, 

(a) For each xi, . . . x fc , K^ k \xi, . . . , x n ), thought of as a function of (x fc+1 , . . . , x n ) on 
Gk+i is a truncated kernel of width t\ + ■ ■ ■ + tk + A r 1 (x 1 ) • ■ ■ + JVfc(xfc), multiplied 

by the improving factor FJ t, [tx + ■ ■ ■ tj + Aq + ■ ■ ■ Nj] 3 . 

i=i 

(b) For each r < k, a similar statement holds for d^K^ k \x), except that now the 



r 



h part of the improving factor is t r \ti + • • ■ t r + Aq ■ ■ ■ + N r ] ' 1 r r . 



Notice that if the inductive hypothesis holds for k, that is for K^ k \ then since 
K( k+1 \xi ■ ■ ■ x n ) = * Vt* i > where the convolution is taken on the group Gk+i, 
therefore does not involve the variables x l5 . . . x^ because of the nature of our coor- 
dinate system. As a result, we get the conclusion for k + 1, that is for K^ k+1 \ To see 
this, we merely apply Proposition 17.61 part (2), for the case of the group Gk+\- 

More precisely, we are convolving a truncated kernel of width a — ti + ■ • • + + 
Aq + ■ — h Nk (on Gfc+i) with a function ip (which equals t Ptk+i > ) °f width b = t^+i- 
The result is a truncated kernel of width a + b — t\--- + t^+i + Aq ■ — h Nk on Gk+i, 
together with a further factor -^+. That is, together for K^ k+1 ^ we have, as a function 

Xk+2, ■ ■ - Xn, a truncated kernel of width t\ h tk+i + Aq + ■ ■ • Nk+i, times a factor 

of the form b(a + b)^ 1 . So the full improvement is 



b a + b t 



k+l 



a + b (a + b + A fc+1 )Q^i+ 1 (h ■ ■ • + + Ax • • • + A fe+ i) Q *+ 1+1 ' 

as was needed. The same kind of improvement holds for the estimates of d@f K^ k+1 \x), 
for £< k+l. 

Thus, the inductive hypothesis (now the conclusion of the induction) holds for 
k = n. As a result, it is clear that \K*Q t (x)\ < cT t (x) and \d Xk (K *<& t (x))\ < c\t\ + 

... + t k + N l --- + Nk] ~ k T t (x), for every k, 1 < k < n. Since X fc R = d Xk + Ei >k h i d x M 
where h k is a homogeneous polynomial of degree i — k, it follows, in particular, that 
X*{K * $ t )(x) < c [ti • • ■ + t k ] _i T t (x). 

The results for §* t * K and X^{^* * K) follow in the same way, but require a 
canonical coordinate system in the reverse order. Alternatively we can deduce it 
from the previous case by using the inversion x — > x _1 . 



9.5. Key estimates: operators. 
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We define P t (f) = / * $t, and P* = f * $*, with t = (ti,t 2 , •••*„) and ^ > 0, 
1 < j < n, with $ t and defined at the beginning of the previous section. We 
suppose /C is a flag kernel and Tf = f * K when / is a Schwartz function. We recall 
the maximal operator M and let M = M o M, i.e. M(f) = M(M(f)). 

Theorem 9.10. 

(a) \P t T(f)(x)\<cM(f)(x), allt. 

(a') Similarly \{T P*){f){x)\ < cM(f)(x), allt. 

(b) \P t TP*(f)(x)\ < j{s,t)M{f){x), where for some 5 > 0, 

i Sk tk x 

1{s, t) < c n min — , — 
\k=i \t k s k 

Note: the conclusion will be seen to hold for S = -K. 

Proof. The function P{Pf is given by (/ * K) * $ t = / * (K * $ t ). Hence conclusion 
(a) is a direct consequence of Theorem 19.61 and Theorem 19.91 part (1). The same is 
true for conclusion (a'). 

Turning to conclusion (b), we first fix k, and consider the situation when t^j ' Sk = 
p > 1. With this p given, we next divide our consideration in two cases. 

Case I: With a a positive constant, to be specified below, sj-i/sj > p a for at least 

one j, with 2 < j < k. 
Case II: Sj-i/sj < p a for all j, with 2 < j < k. 

To handle Case I we need the following observation that will give us the needed 
gain. Recall the notation rj R (k) = m/ c (i?p fc ' ) )~ 1 x R (fe) used above in Section |9~T1 



Lemma 9.11. If Sj~\/sj > p°~ , p > 1, th 



en 



1 4 ) * 



Proof of Lemma \9. 1 1\ In analogy with equation f 1 9 . 1 j) in Section |93| we have that 
on Gj 

^] =Y.^y x "^) ( 9 - n ) 

r>j 

where are left-invariant vector fields on Gj, with ip^ r > C°° functions supported in 
the unit ball. Now 



and X r R ^yg_iM is of the form sj^ipsj^. Combining these gives a sum ^~](sjf 

r>j 

Sj^i) r tl'^ * V's^j an d since Sj/sj-i < p~ a , the lemma is proved. □ 
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Consider the operator P* given by P*(f) = f * $* with $* = (pf} * (pi^Z^ •■■* </?sl ■ 
Because of Lemma 19.111 and since j > 2, we have |$*| < cp~ 2cr rj * ■ ■ ■ * fjnW = 

cp~ a ^ = cp~ 2a A*, in the terminology in Section |9~T1 So by equation (19.61 ) there, 
it follows that \P*(f)(x)\ < cp~ 2cr M(/)(x) Combining this with the first conclusion 
already proved that \P t T(F)(x)\ < cM(F)(x), with F = P*(f) yields 

\P t TP:(f)(x)\<cp- 2 °M(f) (9.12) 

since M = M ■ M. 

We now turn to Case II. Here Sj_i/sj < p a , for all j, 2 < j < k. Thus Sj < 

Skp a ( k ~^ for 1 < j < k. If we set = sup Sj, then < s^p "^ -1 ). Next we recall 

l<j<k 

the following fact: 

If Xf is a right-invariant vector field of degree i > j on Gj, then one can write 



where X,r are left-invariant vector fields of degree r. This follows by writing Xf = 
y ^ht tr {x)Xl' with hi jr a homogeneous polynomial of degree r — i and arguing as in 
Proposition 2.3. 

With this in hand, consider (p k Sk * <p*~} 1 ■■■* <psi ■ First use that j G tp^dx = 
0, which by the analogue of assertion (19.101) (for left-invariant vector fields on the 
group Gk) gives an expression involving the action of left-invariant vector fields (of 
degrees > k). Next pass from the left-invariant vector- fields acting on ipiy to the 
corresponding right-invariant vector fields acting on <p*~} 1 , via the rule 

{Xfy) * <p = i/; * X?<p. (9.13) 

At this point utilize the remark on page [HH to pass to left-invariant vector fields, and 
then use the rule (19. 13j) above to pass to <Ps~_ 2 2 , e ^ c - 

Putting this all together leads quickly to the following conclusion: the convolution 

is a finite sum of expression of the form 

W^ff*^;:?---*^?) ( 9 - 14 ) 

for r > k. Here we use the fact that s* > Sj, j < k. Now consider 

* ^"-i ' ■ * * &i *K*®t- (9.15) 

By applying the rule (I9.3P we can pass the left-invariant vector field X^ as a right- 
invariant vector field acting on K*§ t . We keep in mind that r > k, and use Theorem 
[9791 part (2). Therefore $* * K * $ t = <p1" } * ' ' ' * 

* * <^>i is majorized by 
a constant multiple of si ■ tZ r A* * T t where A* = fj R («) * ?j„ n -i ■ • • * ?y ( i) . 

s n— 1 s l 
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In view of Theorem 19.61 and inequality (19.61 ) in Section |9~T| we get that 

|P t TP;(/)(x)| < csup«0 M(M(/))(x). (9.16) 

r>fc 

We now pick a = 1/2/c. Since s* < Skp^ k ~ l \ and Sk/tk = p~ l , p > 1, we get 

|P t TP;(/)(x)| <cp^/ 2 M(M(/))(x). 
Combining this with the previous case given by (19.121) yields 

\p t TP:(f)( X )\<c [-*] ^(/)(x), ifv«*>i- 



By a parallel argument the analogous result holds of Sfc/tfe > 1- Hence 

•tk Sk ■ 

Since this holds for all k, 1 < k < n, we can take the geometric mean of these 
inequalities. The result is conclusion (2) of Theorem 19.101 with 5 = 1/n 2 . □ 



\P t TP:(f)(x)\ < c min (|*, **) *M (/)(*) 

\Tt Si./ 



9.6. Square functions and L p -boundedness. 

We will construct the square functions for G as products of the (one-parameter) 
square functions of the sub-groups Gk, 1 < k < n. Each G fc is a homogeneous group 
with family of dilations 6 r , and so there exists a finite-dimensional inner-product 
space Vk and a pair Lp^ k \ ip^ of V^-valued functions, with (p^ e C^(Gk) supported 
in the unit ball, and £ <S(Gk) a Schwartz function, so that J G <p( fc ) (x) rfx = 
/ Gfc V (fc) (x) rfx = 0, and 

/ ^ ) (xy" 1 )-^ ) (y)- = V (9.17) 
7o a 

Here <pi^(x) = a~ Q ' :_Qfe + 1 "' _Q ' I (p^^ ) (5 a -i(x)), with a similar definition for ipa (x). 
Also • denotes the inner product in V^. See |FS82j . Theorem 1.61. 

We define operators Pi and acting on functions on Gk, by setting Pa k \f) = 
f * <pi fc) and Qi fc) (/) = / * ^ { a ] . Note that fl9~T7D shows that 

r P W.QW* = l d . (9.18) 
Jo a 

Next, define the square functions Sf. and Sf by setting 



M/)(x) = (/|Pi fe) (/)(x)| 2 ^) 1/2 , 



oo 

sf (/)(*) = (/ig«(/)(x)i 2 ^) 1/2 . 
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The usual theory of singular integrals in |Ste93] and |FS82j together with (19 . 18[) then 
gives the inequalities 

II / Sk(f) \\l^\\ S*(f) \\ LP (9.19) 

for 1 < p < oo on Gk- The result is valid not only for scalar- valued /, but also for / 
that take their values in a Hilbert space. 

Having recalled the known results for Gk we transfer them to the whole group G 
by writing 



oo 

= ( / \pi k w d iy 12 stu) = (/iq«(/)i^) i/2 



We get as a consequence 



oo 

[p^-Qi k) - = id mm) 

J a 



on G, and 

II / ||l>(G) ~|| S k (f) ||lp(G)~|| S*(f) \\lp(G) ■ 

With the above one-parameter theory arising from each Gk we come to the square 
functions on G that are relevant for us. For each t = (ti,...t n ) e (IR + ) n we set 

P t = ■ ■ P^. That is, P t (f) = f * $ t , where $ t = * ^ ••■* # 

(k) (k) 

and (p tk = Oxi-xjb-i ® V 9 ^ • Note also that $ t is a V^-valued function, where V = 
V x ®V 2 ---®V n . Similarly, we define P* = Pg> • pf • • • • P£\ Qt = ■■■■ Q% , 
and Q* = Qg> • • • • q£>. Also Q t (/) = / * ^, with & = ^ * • • • • ^ and & is 
also ^/-valued. Finally, we set 

S(f)(x)= ( J \PtU)\ 2 j^)'\ ©(/)(*)=( / |MQt/| 2 |' V2 



Here we use the abbreviation that [i] = ii • t 2 ■ ■ ■ -t 
Lemma 9.12. We have 



(R+) n 

(b) || / || L p< A p (I || LP , 1 < p < oo; 

(c) || &(f) \\ LP < A p || / \\ LP; l<p<oo. 

To prove (a), we take first the identity ( 19. 18ft when k = 1, and a — t\. Next we 

(2) ~ (2) 

multiply on the left of both sides by PA/ and on the right of both sides by Q\ 2 
and integrate in t 2 , using ( 19.181 ) for k = 2. Continuing this way yields (a). The 
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inequality (b) follows from repeated comparisons of the corresponding inequalities 
(19.191 ) for S k . Also (c) follows by applying (19.191 ) for Sf and a two-fold application 
of the vector-valued maximal function in Corollary 19.41 for F t (x) = Qt(f)(x) ■ [t]" 1 ^ 2 . 
The final lemma needed is as follows. 

Lemma 9.13. Suppose K is a flag kernel end T(f) = f * JC. Then 

S(T(f))(x)<c&(f)(x). (9.20) 

Proof. Now P t T(f)(x) = L + >„ P t -TP*Q s (f)(x) ^? by part (a) of the previous lemma. 
Hence Theorem 19.101 part (b) shows that 

f ds 

\P t T(f)(x)\< J 7 (M)A4(g.(/))(:c)pp 

where j{s,t) = c(j]Li min (j, ^)) <5 - Thus 

\P t T(f)(x)\ 2 < [ ^s : t)(MQ s (f)(x)) 2 ^ ■ I(t) 
with I(t) = f j(s,t) H, by Schwarz's inequality. But sup I(t) = A < oo, since 

(K+) n * 

OO r tfc OO 

t fe 

A further integration in i (noting that also J/ K -n„ j(s, t) ^ < A) then gives the desired 
result. □ 

Theorem 9.14. With Tf = f * K as above, we have \\ Tf \\lp(g)< A v || / ||lp(g), 
1 < p < oo. 

This now follows directly from (4), once we apply Lemma 14.1 part (b), for Tf in 
place of /, and then part (c) of that lemma. 

10. Appendix I: Lifting 

Suppose that T(f) = f * K is a convolution operator on Gk with / G S{Gk) 
and JC G S'(Gk), a tempered distribution. Then Tf can be written T(/)(x) = 
f Gk /C(y) /(xy^ 1 ) dy = (/C, F x ), where F x is the element of S(G k ) given by F x (y) = 
/(xy _1 ) for y G We can lift T to a convolution operator on G, denoted by 
f, given by T(/)(x) = K(y)/(xy- 1 ) rfy = </C,F x ), where / G 5(G), and 

^x(y) = /(xy" 1 ) for y G G*. 

We describe this lifting in terms of the coordinate system used in the proof of 
Theorem 19.91 We can write each x G G as a product x = x' • x, with x' = 
(xi, . . . Xfc-i, 0, ... 0) and x G Gk, where x = (0, ... 0, x^, . . . x n ). With this coor- 
dinate system, we define K G S'(G) as 8 X > (g) K, where x = x' • x, and we set 
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f(/)(x) = (/*iO(x)_for / G S(G) and x G G. Then T(/)(x) = T(/)(x' ■ x). 
However observe that T(/)(x' • x) = T(/ x )(x), where / x is the element of S(Gk) 
given by / x '(y) = /(x' • y), y G G k . Therefore 

f(/)(x'-x) = T(f')(x) (10.1) 

Next, suppose that T satisfies the bound 

\\T(f) \\ LP(Gh) < A || / \\ LP(Gk) (10.2) 

for each / G S(G). Then applying this to / = via (110.11) . (and assuming p < oo), 
gives 

|T(/)(x'x)| p d2 < A p ^ \f x '(x)\ p dx , 

for each x', and an integration in x' yields 

II T(f) \\ LP{G) < A || / || LP(G) . (10.3) 

Suppose next that K depends on a parameter t, K = K t and set T t (f) = f * K t . 
There the same argument shows that 

|| sup \ f t {f)\ \\ LP(G) < A || / \\ LP{G) , for all / G S(G) 

t 

whenever 

|| sup \T t (f)\ \\ LP{Gk) < A || / \\ LP{Gh) , for all / G <S(G fc ) . 

i 

This proves that the lifted maximal inequality || M k (f) \\h>(g)< A p || / \\lp( G ) 
follows from the corresponding inequality on G k , by considering first the case when 
/ is non- negative, (and K t = r)$), and then by replacing / by |/|. 

In the same way the vector-valued maximal inequality (19. 4p on G k can be lifted 
to the corresponding inequality (19.41 ) on G. In fact, it suffices to prove (19.41 ) when 
there are only m non- zero f/s, 1 < j < m, with bounds independent of m. With 
this understanding, set 



/ = (A, . . . f m ), T t (f) = (^(A), • • • Z£(/m)), \f\ = (E l/i 

3=1 

m 1/2 

* = (*i. ..o, Ti 3 {f) = f*Ki p \T t (f)\ = (J2\ T U^\ 2 ) ■ 

i=i 

m 1/2 

Note that ( > sup |T/.(/)| 2 ) = sup |T t |(/)|. Then as before the inequality 



|| sup \T t {f)\ \\ LP(Gn) < A || |/| || LP(Gfe) , / G 5(G„) 

t 

implies the corresponding inequality for T t lifted to G, that is for T t , and this then 
yields the desired result. 
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We should remark that the lifting procedure used here can be viewed in terms of 
the more abstract "transference" method presented in [CW77] . 

11. Appendix II: An estimate for a geometric sum 

Recall that E n = {I = . . . ,i n ) ^7h n \i\ < i 2 < • • • < i n }- More generally, if 
< B x < B 2 < ■ ■ ■ < B n , let 

A{B) = {l={i 1 ,...,i n )ee n \B j <2 i * for l<j<n}. (11.1) 

We establish the following estimate for geometric sums which we shall use several 
times. 

Proposition 11.1. Let ctj > for 1 < j < n, and let M > Efc=i a fc- There 
is a constant C depending on n, on M , and on the numbers {ctj} so that for any 
Ai, . . . , A n G (0, oo) and any < B\ < B 2 < ■ ■ ■ < B n , 



i4l(l + ELi2^A fe ) 



<C ]J(A 1 + A 2 + --- + A j )-^, (11.2) 



T , —Tu < C TT(^i +A 2 + --- + Aj + Bj)-**. (11.3) 

JeA(B) [ l + 2^k=l 1 A k) j=l 

Proof. Note that if we take Bi — ■ ■ ■ — B n = 0, then the inequality (QXj3j) gives the 
inequality All .2j) . If for x±, . . . , x n > we put 



IL it 

(p(x lt ...,x n ) = (n^) ( x + 5^ xfcy4 

k=l k=l 

then if 1 < Sj < 2 for 1 < j < n, we have 



-M 



-M 



(sixi, . . . , s n x n ) < 2 |a| \\ x a k k (l + x * A k) ^ 2 |a| (^(a;i, ...,x n ) 



k=l k=l 

For each / = (h, ...,i n ) G Z n , let Qi = {x e R n \ \2r ik < x k < 2~ ifc }. It follows 
that there is a constant C = C(n, a) depending only on n and a such that 

c . x f d Xl ...d Xn< nLi(2-") afc r d Xl ..-dx n 

JQi ^■•■ X n ~ (1 + ELl 2 -^^) 

Thus 



Qi Xl-'-Xn ' (l + ELl 2 ^^) J Qi X l--' X n 

sr nLi(2- tfc ) Qfc < r f ,, dx 1 ---dx n 

IeA(B) { l + 2^k=l Z A k) 



X\ • • • x r 



U/gA(s) Qi 
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On the other hand, it is easy to check that if I £ A(JB) and x £ Qj, then < Xk+i < 
2xk for 1 < k < n — 1 and x^ < B^ 1 for 1 < k < n. Thus if we put 



\%k+i < %k < B k 1 for 1 < A; < n — 1 



x £ 



< x n < S" 1 



then UieA(B) — ^(-B)- Thus to prove the Proposition, it suffices to show that 

f Ql=l£? dX! ■ ■ ■ dx n < -pr + ... + A , B S-"I. 



However 



fc=i 



fc=i 



- M dxi ■ ■ ■ dx Ti 

X\ ' ' ' x n 



t M e- 1 



n(B) 



n 

k=l 



ak r -x k A k t d%l dXr, 
x k e 



X\ ■ ■ • X 



n 



dt 

T" 



We will show that we can estimate the inner integral on the right hand side by 



^) k=1 



X k e 



k -x k A k t ' ' ' dx n 



•^1 ' ' ' <En 



;ii.4) 



< C(n, a) + t- a ^)(A l +A 2 + --- + Aj + B 5 ) 

7=1 



and since M > X^=i a ki this will complete the proof. 

To establish (II 1.4ft . we first establish an estimate for fx s a e~ sAt — . On the one 

2 S 

hand, we have 



B~ 



s e 



ds ^ 
— < e ^ 
s 



,a-l 



ds = oT x e-^ xAt B- a . 



On the other hand we have 



f d<i r°° 

/ s » e ~sAt ™ < ^ M ya / s ^ e ^ds < C a {At)~ a e~\ xM 
J% s JlxAt 



Putting the two together, we have the estimate 



s a e~ sAt 



ds 



< C a e—* xAt (At + B)- a < C a e—* xAt {A + B)~ a (l + t~ a ). (11.5) 
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We now establish All .4|) by induction on n. When n — 1, we use (111.51) with x = 
to get 



x l e 



X\ 



For the induction step, we have 

n 

n 



a k -xuAa dx\ * ' ' dx n 



n(B) 



k=l 



X\ ' ' ' x n 



™(B) k=2 



B7 1 



n -axA lt dx l] dx 2 ---dx n 



x 2 ■ ■ ■ x n 



Q'(B) 



n 

k=2 



r a k c -x k A k t c -\x 2 Ait dX2 ' ' " dx n 

Jk x 2 ■ ■ ■ X n 



Here Q'(B) 



x e 



and we have 



\%k+i < %k < -B fe 1 for 2 < k < n — 1 

< X n < B- 1 

used the estimate in All. 5j) . The last integral on the right-hand side is thus of the 
same form as the original integral, except that n has been replaced by n — 1, and A 2 
has been replaced by A 2 + \A\. We can thus use our inductive hypothesis on this 
integral, and we obtain the desired estimate. This completes the proof. □ 
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